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METHODS AND COMPOSITIONS FOR INHIBITION OF MEMBRANE 
FUSION- ASSOCIATED EVENTS, INCLUDING HIV TRANSMISSION 

5 

1. INTRODUCTION 
The present invention relates, first, to DP178 
(SEQ ID NO;l), a peptide, also referred to herein as 
T20, corresponding to amino acids 638 to 673 of the 
HIV-ljA, transmembrane protein (TM) gp41, and portions 
or analogs of DP178 (SEQ ID NO:l), which exhibit anti- 
membrane fusion capability, antiviral activity, such 
as the ability to inhibit HIV transmission to 
uninfected CD-4* cells, or an ability to modulate 
intracellular processes involving coiled-coil peptide 

15 structures. The present invention also relates -to 

peptides analogous to DP107 (SEQ ID NO:25) , a peptide, 
which is also referred to herein as T21, corresponding 
to amino acids 558 to 595 of the HIV-l^ transmembrane 
protein (TM) gp41, having amino acid sequences present 

20 in other viruses, such as enveloped viruses, and/or 
other organisms, and further relates to the uses of 
such peptides. These peptides exhibit anti-membrane 
fusion capability, antiviral activity, or the ability 
to modulate intracellular processes involving coiled- 
coil peptide structures. 

25 

The gp41 region from which DP107 is derived is 
referred to herein as HR1 . The gp41 region from which 
DP17 8 is derived is referred to herein as HR2 . As 
discussed herein, the gp41 HR1 and HR2 regions 
interact (non-covalently) with each other and/or with 
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T20 and T21 peptides. This interaction is required 
for normal infectivity of HIV. 

The present invention therefore additionally 
relates to methods for identifying compounds, 
including small molecule compounds, that disrupt the 
5 interaction between DP178 and DP107, and/or between 
DP107-like and DP178-like peptides. In one 
embodiment, such methods relate to identification and 
utilization of modified DP178, DP178~like, DP107 and 
DP107-like peptides and peptide pairs that interact 

10 with each other at a lower affinity than the affinity 
exhibited by corresponding "parent" or "native" 
peptides. Further, the invention relates to the u?e 
of DP178, DP178 portions, DP107, DP017 portions aii#/br 
analogs and other modulators, including small 
molecules modulators, of DP178/DP107, 
DP178-like/DP107-like, or HR1/HR2 interactions as 
antifusogenic or antiviral compounds or as inhibitors 
of intracellular events involving coiled-coil peptide 
structures. The invention is . demonstrated, first , by 
way of an Example wherein DP178 (SEQ ID:1), and a 

20 peptide whose sequence is homologous to DP178 are each 
shown to be potent, non-cytotoxic inhibitors of HIV-1 
transfer to uninfected CD-4 + cells. The invention is 
further demonstrated by Examples wherein peptides 
having structural and/or amino acid motif similarity 

25 to DP107 and DP178 are identified in a variety of 

viral and nonviral organisms, and in examples wherein 
a number of such identified peptides derived from 
several different viral systems are demonstrated to 
exhibit antiviral activity. The invention is still 

30 further demonstrated by way of Examples wherein other 
DPl78-like and DPl07-like peptides are identified that 
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interact with their corresponding HR1 and HR2 domains 
with a lower affinity than the affinity exhibited by 
the native DP178 or DP107 peptide from which they are 
derived. 

2. BACKGROUND OF THE INVENTION 
2.1 MEMBRANE FUSION EVENTS 
Membrane fusion is a ubiquitous cell biological 
process (for a review, see White, J.M., 1992, Science 
258 : 917-924) . Fusion events which mediate cellular 
housekeeping functions, such as endocytosis, 
constitutive secretion, and recycling of membrane 
components, occur continuously in all eukaryotic 
cells . 

Additional fusion events occur in specialized 
cells. Intracellularly, for example, fusion events 
are involved in such processes as occur in regulated 
exocytosis of hormones, enzymes and neurotransmitters. 
Intercellularly, such fusion events feature 
prominently in, for example, sperm-egg fusion and 
myoblast fusion. 

Fusion events are also associated with disease 
states. For example, fusion events are involved in 
the formation of giant cells during inflammatory 
reactions, the entry of all enveloped viruses into 
cells, and, in the case of human immunodeficiency 
virus (HIV) , for example, are responsible for the 
virally induced cell-cell fusion which leads to cell 
death, 

2.2. THE HUMAN IMMUNODEFICIENCY VIRUS 
The human immunodeficiency virus (HIV) has been 
implicated as the primary cause of the slowly 
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degenerative immune system disease termed acquired 
immune deficiency syndrome (AIDS) (Barre-Sinoussi, F. 
et al . , 1983, Science 220 :868-870; Gallo, R. et al . , 
1984, Science 224 ; 500-503) . There are at least two 
distinct types of HIV: HIV-1 (Barre-Sinoussi, F. et 
5 al. , 1983, Science 220:868-870; Gallo R. et al . , 1984, 
Science 224:500-503) and HIV-2 (Clavel, F. et al. , 
1986, Science 231:343-346; Guyader, M. et al . , 1987, 
Nature 3^6:662-669) . Further, a large amount of 
genetic heterogeneity exists within populations of 

10 each of these types. Infection of human CD-4+ T- 
lymphocytes with an HIV virus leads to depletion of 
the cell type and eventually to opportunistic 
infections, neurological dysfunctions, neoplastic 
growth, and ultimately death. 

HIV is a member of the lentivirus family of 
retroviruses (Teich, N. et al. , 1984, RNA Tumor 
Viruses, Weiss, R. et al. , eds., CSH-Press, pp. 949- 
956) . Retroviruses are small enveloped viruses that 
contain a diploid, single -stranded RNA genome, and 
replicate via a DNA intermediate produced by a 

20 virally-encoded reverse transcriptase, an RNA- 

dependent DNA polymerase (Varmus, H. , 1988, Science 
24_0-' 1427-1439) . Other retroviruses include, for 
example, oncogenic viruses such as human T-cell 
leukemia viruses (HTLV-I, -II, -III) , and feline 

25 leukemia virus. 

The HIV viral particle consists of a viral core, 
composed of capsid proteins, that contains the viral 
RNA genome and those enzymes required for early 
replicative events. Myristylated Gag protein forms an 

3q outer viral shell around the viral core, which is, in 
turn, surrounded by a lipid membrane enveloped derived 
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from the infected cell membrane. The HIV enveloped 
surface glycoproteins are synthesized as a single 160 
Kd precursor protein which is cleaved by a cellular 
protease during viral budding into two glycoproteins, 
gp4l and gpl20. gp4l is a transmembrane protein and 
5 gpl20 is an extracellular protein which remains non- 
covalently associated with gp41, possibly in a 
trimeric or multimeric form (Hammarskjold, M. and 
Rekosh, D., 1989, Biochem. Biophys. Acta 989:269-280) . 
HIV is targeted to CD-4 + cells because the CD-4 
10 cell surface protein acts as the cellular receptor for 
the HIV-1 virus (Dalgleish, A. et al . , 1984, Nature 
312:763-767; Klatzmann et al . , 1984, Nature 312:767- 
768; Maddon et al . , 1986, Cell 47 : 333-348) . Viral " 
entry into cells is dependent upon gpl20 binding the 
cellular CD-4 + receptor molecules (McDougal, J.S._ et 
al . , 1986, Science 231:382-385; Maddon, P.J. et al . , 
1986, Cell 47:333-348) and thus explains HIV's tropism 
for CD-4 + cells, while gp41 anchors the enveloped 
glycoprotein complex in the viral membrane. 



15 



20 2.3. HIV TREATMENT 

HIV infection is pandemic and HIV associated 
diseases represent a major world health problem. 
Although considerable effort is being put into the 
successful design of effective therapeutics, currently 

25 no curative ant i -retroviral drugs against AIDS exist. 
In attempts to develop such drugs, several stages of 
the HIV life cycle have been considered as targets for 
therapeutic intervention (Mitsuya, H. et al. , 1991, 
FASEB J. 5:2369-2381) . For example, virally encoded 
reverse transcriptase has been one focus of drug 
development. A number of reverse-transcriptase- 
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targeted drugs, including 2', 3'-dideoxynucleoside 

analogs such as AZT, ddl, ddc, and d4T have been 

developed which have been shown to been active against 

HIV (Mitsuya, H. et al. , 1991, Science 2£9: 1533-1544) . , 

While beneficial, these nucleoside analogs are not 

5 curative, probably due to the rapid appearance of drug 

resistant HIV mutants (Lander, B. et al. , 1989, 

Science 243:1731-1734) . In addition, the drugs often 

exhibit toxic side effects such as bone marrow 

suppression, vomiting, and liver function 

10 abnormalities. 

Attempts are also being made to develop drugs 

which can inhibit viral entry into the cell, the 

earliest stage of HIV infection. Here, the focus has" 

thus far been on CD4, the cell surface receptor for 

HIV. Recombinant soluble CD4; for example, has been 
15 * 
shown to inhibit infection of CD-4* T-cells by some 

HIV-1 strains (Smith, D.H. et al . , 1987, Science 

238 : 1704-1707) . Certain primary HIV-1 isolates, 

however, are relatively less sensitive to inhibition 

by recombinant CD-4 (Daar, E. et al . , 1990, Proc. 

20 Natl. Acad. Sci. USA £7: 6574-6579) . In addition, 

recombinant soluble CD-4 clinical trials have produced 
inconclusive results (Schooley, R. et al . , 1990, Ann. 
Int. Med. 112:247-253; Kahn, J.O. et al. , 1990, Ann. 
Int. Med, 112:254-261; Yarchoan, R. et al . , 1989, 

25 Proc. Vth Int. Conf. on AIDS, p. 564, MCP 137). 

The late stages of HIV replication, which involve 
crucial virus-specific secondary processing of certain 
viral proteins, have also been suggested as possible 
anti-HIV drug targets. Late stage processing is 
dependent on the activity of a viral protease, and 
drugs are being developed which inhibit this protease 
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(Erickson, J., 1990, Science 249 : 527-533) . The 
clinical outcome of these candidate drugs is still in 
question. 

Attention is also being given to the development 
of vaccines for the treatment of HIV infection. The 
5 HIV-1 enveloped proteins (gpl60, gpl20, gp41) have 
been shown to be the major antigens for anti-HIV 
antibodies present in AIDS patients , (Barin, et al. , 
1985, Science 228 : 1094-1096) . Thus far, therefore, 
these proteins seem to be the most promising 

20 candidates to act as antigens for anti-HIV vaccine 
development. To this end, several groups have begun 
to use various portions of gpl60, gpl2 0, and/or gp41 
as immunogenic targets for the host immune system. 
See for example, Ivanoff,. L. et al. , U.S. Pat. No. 
5,141,867; Saith, G. et al . , WO 92/22,654; Shafferman, 
A. , WO 91/09,872; Formoso, C. et al, , WO 90/07,119. 
Clinical results concerning these candidate vaccines, 
however, still remain far in the future. 

Thus, although a great deal of effort is being 
directed to the design and testing of anti- retroviral 

20 drugs, a truly effective, non- toxic treatment is still 
needed. 



3. SUMMARY OF THE INVENTION 
2 5 The present invention relates, first, to DP178, a 

3 6 -amino acid synthetic peptide, also referred to 
. herein as T20, corresponding to amino acids 63 8 to 673 
of the transmembrane protein (TM) gp41 from the HIV-1 
isolate LAI (HIV-l^) , which exhibits potent anti- 
^ HIV-1 activity. The gp41 region from which DP178 is 
derived in referred to herein as HR2 . 
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The invention further relates to those portions 
and analogs of DP178 which also show such antiviral 
activity, and/or show anti-membrane fusion capability, 
or an ability to modulate intracellular processes 
involving coiled-coil peptide structures. The term 
5 "DP178 analog" refers to a peptide which contains an 
amino acid sequence corresponding to the DP178 peptide 
sequence present within the gp41 protein of HIV-l^j, 
but found in viruses and/or organisms other than HIV- 
Ijju. Such DP178 analog peptides may, therefore, 
10 correspond to DP178-like amino acid sequences present 
in other viruses, such as, for example, enveloped 
viruses, such as retroviruses other than HIV-l^, as 
well as non-enveloped viruses. Further, such 
analogous DP178 peptides may also correspond to DP178- 
like amino acid sequences present in nonviral 
organisms . 

The invention further relates to DP107, a 
peptide, which is also referred to herein as T21, 
corresponding to amino acids 558-595 of the HIV-1^ 
transmembrane protein (TM) gp41. The gp41 region from 

20 which DP107 is derived is referred to herein as HR1. 
The invention also relates to those portions and 
analogs of DP107 which that also show antiviral 
activity, and/or show anti-membrane fusion capability, 
or an ability to modulate intracellular processes 

25 involving coiled-coil peptide structures. The term 
"DP107 analog" as used herein refers to a peptide 
which contains an amino acid sequence corresponding to 
the DP107 peptide sequence present within the gp41 
protein of HIV-1^, but found in viruses and organisms 

3q other than HIV-1^. Such DP107 analog peptides may, 
therefore, correspond to DP107-like amino acid 
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sequences present in other viruses, such as, for for 
example, enveloped viruses, such as retroviruses other 
than HIV-l^j, as well as non-enveloped viruses. 
Further, such DP107 analog peptides may also 
correspond to DP107-like amino acid sequences present 
5 in nonviral organisms. 

Further, the peptides of the invention include 
DP107 analog and DP178 analog peptides having amino 
acid sequences recognized or identified by the 
107x178x4, ALLMOTI5 and/or PLZIP search motifs 

10 described herein. 

The peptides of the invention may, for example, 
exhibit antifusogenic activity, antiviral activity, 
and/or may have the ability to modulate intracellular* 
processes which involve coiled-coil peptide 
structures. With respect to the antiviral activity of 
the peptides of the invention, such an antiviral 
activity includes, but is not limited to the 
inhibition of HIV transmission to uninfected CD-4 + 
cells. Additionally, the antifusogenic capability, 
antiviral activity or intracellular modulatory 

20 activity of the peptides of the invention merely 
requires the presence of the peptides of the 
invention, and, specifically, does not require the 
stimulation of a host immune response directed against 
such peptides. 

25 The peptides of the invention may be used, for 

example, as inhibitors of membrane fusion-asociated 
events, such as, for example, the inhibition of human 
and non-human retroviral, especially HIV, transmission 
to uninfected cells. It is further contemplated that 
the peptides of the invention may be used as 
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modulators of intracellular events involving coiled- 
coil peptide structures. 

The peptides of the invention may, alternatively, 
be used to identify compounds, including small 
molecule compounds, which may themselves exhibit 
5 antifusogenic, antiviral, or intracellular modulatory 
activity. For example, in one embodiment, the 
peptides of the invention are used to identify other 
DP178-like and/or DP107-like peptides that interact 
with each other and/or with their complementary HR1 or 

10 HR2 domains with a lower affinity than the affinity 
exhibited by the "parent" or "native" DP178 or DP107 
peptides from which they are derived. Such DP178-like 
and DP107-like peptides, which are also part of the " 
present invention, may also be used, e.g., to identify 

^ compounds, such as small molecule compounds, that 
exhibit antifusogenic, antiviral, or intracellular 
modulatory activity. 

Additional uses include, for example, the use of 
the peptides of the invention as organism or viral 
type and/or subtype- specif ic diagnostic tools. 

20 

The terms "antifusogenic" and " ant i- membrane 
fusion", as used herein, refer to an agent* s ability 
to inhibit or reduce the level of membrane fusion 
events between two or more moieties relative to the 
level of membrane fusion which occurs between said 

25 moieties in the absence of the peptide. The moieties 
may be, for example, cell membranes or viral 
structures, such as viral envelopes or pili. The term 
"antiviral", as used herein, refers to the compound's 
ability to inhibit viral infection of cells, via, for 

3Q example, cell-cell fusion or free virus infection. 

Such infection may involve membrane fusion, as occurs 
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in the case of enveloped viruses, or some other fusion 
event involving a viral structure and a cellular 
structure ( e.g. , such as the fusion of a viral pilus 
and bacterial membrane during bacterial conjugation) . 
It is also contemplated that the peptides of the 
5 invention may exhibit the ability to modulate 

intracellular events involving coiled-coil peptide 
structures. "Modulate", as used herein, refers to a 
stimulatory or inhibitory effect on the intracellular 
process of interest relative to the level or activity 
10 of such a process in the absence of a peptide of the 
invention. 

Embodiments of the invention are demonstrated 
below wherein an extremely low concentration of DP17B" 
(SEQ ID:1), and very low concentrations of a DP178 

^ homolog (SEQ ID: 3) are shown to be potent inhibitors 
of HIV-l mediated CD-4 + cell-cell fusion ( i.e. , 
syncytial formation) and infection of CD-4* cells by 
cell-free virus. Further, it is shown that DP178 (SEQ 
ID:1) is not toxic to cells, even at concentrations 3 
logs higher than the inhibitory DP-178 (SEQ ID:1) 

20 concentration. 

The present invention is based, in part, on the 
surprising discovery that the DP107 and DP178 domains 
of the HIV gp41 protein non-covalently complex with 
each other, and that their interaction is required for 

25 the normal infectivity of the virus. This discovery 
is described in the Example presented, below, in 
Section 8. The invention, therefore, further relates 
to methods for identifying antif usogenic, including 
antiviral, compounds that disrupt the interaction 

3q between DP107 and DP178, and/or between DP107-like and 
DP178-like peptides. 
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Additional embodiments of the invention 
(specifically, the Examples presents in Sections 9-16 
and 19-25, below) are demonstrated, below, wherein 
peptides, from a variety of viral and nonviral 
sources, having structural and/or amino acid motif 
similarity to DP107 and DP178 are identified, and 
search motifs for their identification are described. 
Further, Examples (in Sections 17, 18, 25-29) are 
presented wherein a number of the peptides of the 
invention are demonstrated exhibit substantial 
antiviral activity or activity predictive of antiviral 
activity. 

3.1. DEFINITIONS 

Peptides are defined herein as organic compounds 
comprising two or more amino acids covalently joined 
by peptide bonds. Peptides may be referred to with 
respect to the number of constituent amino acids, 
i.e. , a dipeptide contains two amino acid residues, a 
tripeptide contains three, etc. Peptides containing 
ten or fewer amino acids may be referred to as 
oligopeptides, while those with more than ten amino 
acid residues are polypeptides. Such peptides may 
also include any of the modifications and additional 
amino and carboxy groups as are described herein. 

Peptide sequences defined herein are represented 

by one-letter symbols for amino acid residues as 

follows : 

A (alanine) 

R (arginine) 

N (asparagine) 

D (aspartic acid) 

C (cysteine) 

Q (glutamine) 

E (glutamic acid) 
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G (glycine) 
H (histidine) 
I (isoleucine) 
L (leucine) 
K (lysine) 
M (methionine) 
F (phenylalanine) 
5 P (proline) 
S (serine) 
T (threonine) 
W (tryptophan) 

Y (tyrosine) 

V (valine) 



4 . BRIEF DESCRIPTION OF THE FIGURES 

10 

FIG. 1. Amino acid sequence of DP178 (SEQ ID:1) 
derived from HIV^; DP178 homologs derived from HIV-1 SF2 
(DP-185; SEQ ID:3), HIV-1 RF (SEQ ID:4), and HIV- l m (SEQ 
ID: 5); DP178 homologs derived from amino acid 
sequences of two prototypic HIV-2 isolates, namely, 

15 HIV-2 rod (SEQ ID:6) and HIV-2 NIH2 (SEQ ID:7) ; control 

peptides: DP-180 (SEQ ID:2), a peptide incorporating 
the amino acid residues of DP178 in a scrambled 
sequence; DP-118 (SEQ ID:10) unrelated to DP178, which 
inhibits HIV-l cell free virus infection; DP-125 (SEQ 

20 ID:8), unrelated to DP178, also inhibits HIV-l cell 
free virus infection; DP-116 (SEQ ID: 9), unrelated to 
DP178, is negative for inhibition of HIV-l infection 
when tested using a cell -free virus infection assay. 
Throughout the figures, the one letter amino acid code 
is used. 

25 

FIG. 2. Inhibition of HIV-l cell-free virus 
infection by synthetic peptides. IC S0 refers to the 
concentration of peptide that inhibits RT production 
from infected cells by 50% compared to the untreated 
control. Control: the level of RT produced by 

30 
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untreated cell cultures infected with the same level 
of virus as treated cultures. 

FIG. 3. Inhibition of HIV-1 and HIV-2 cell-free 
virus infection by the synthetic peptide DP178 (SEQ 
ID:1) . IC S0 : concentration of peptide that inhibits 
5 RT production by 50% compared to the untreated 

control. Control: Level of RT produced by untreated 
cell cultures infected with the same level of virus as 
treated cultures. 

FIG. 4A-4B. Fusion Inhibition Assays. FIG 4A: 
10 DP178 (SEQ ID:1) inhibition of HIV-1 prototypic 

isolate-mediated syncytial formation; data represents 
the number of virus-induced syncytial per cell. FIG. 
4B: DP- 180 (SEQ ID: 2) represents a scrambled control* 
peptide; DP- 185 (SEQ ID: 3) represents a DP178 homolog 
derived from HIV-1 SF2 isolate; -Control, refers to the 
number of syncytial produced in the absence of 
peptide . 

FIG. 5. Fusion inhibition assay: HIV-1 vs. 
HIV-2. Data represents the number of virus -induced 
syncytial per well. ND: not done. 

20 FIG. 6. Cytotoxicity study of DP178 (SEQ ID:1) 

and DP-116 (SEQ ID:9) on CEM cells. Cell 
proliferation data is shown. 

FIG. 7. Schematic representation of HIV-gp41 
and maltose binding protein (MBP) -gp41 fusion 

25 proteins. DP107 and DP178 are synthetic peptides 

based on the two putative helices of gp4l. The letter 
P in the DP107 boxes denotes an lie to Pro mutation at 
amino acid number 578. Amino acid residues are 
numbered according to Meyers et al . , "Human 
Retroviruses and AIDS", 1991, Theoret . Biol, and 
Biophys. Group, Los Alamos Natl. Lab., Los Alamos, NM. 



- 14 - 



WO 01/51673 



PCT/US00/35727 



The proteins are more fully described, below, in 
Section 8.1.1. 

FIG. 8. A point mutation alters the 
conformation and anti-HIV activity of M41. 

FIG. 9. Abrogation of DP178 anti-HIV activity. 
5 Cell fusion assays were carried out in the presence of 
10 nM DP178 and various concentrations of M41A178 or 
M41PA178. 

FIG. 10. Binding of DP178 to leucine zipper of 
gp41 analyzed by FAb-D ELISA. 

10 FIG. 11A-B. Models for a structural transition 

in the HIV-1 TM protein. Two models are proposed 
which indicate a structural transition from a native 
oligomer to a fusogenic state following a trigger 
event (possibly gpl20 binding to CD4) . Common 
features of both models include (1) the native state 
is held together by noncovalent protein-protein 
interactions to form the heterodimer of gpl2 0/41 and 
other interactions, principally though gp41 
interactive sites, to form homo -oligomers on the virus 
surface of the gpl20/41 complexes; (2) shielding of 

20 the hydrophobic fusogenic peptide at the N- terminus 
(F) in the native state; and (3) the leucine zipper 
domain (DP107) exists as a homo-oligomer coiled coil 
only in the fusogenic state. The major differences in 
the two models include the structural state (native or 

25 fusogenic) in which the DP107 and DP178 domains are 
complexed to each other. In the first model (FIG. 
11A) this interaction occurs in the native state and 
in the second (FIG. 11B) , it occurs during the 
fusogenic state. When triggered, the fusion complex 
in the model depicted in (A) is generated through 

30 

formation of coiled-coil interactions in homologous 
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DP107 domains resulting in an extended a-helix. This 
conformational change positions the fusion peptide for 
interaction with the cell membrane. In the second 
model (FIG. 11B) , the fusogenic complex is stabilized 
by the association of the DP178 domain with the DP107 
5 coiled-coil. 

FIG. 12. Motif design using heptad repeat 
positioning of amino acids of known coiled-coils. 

FIG. 13. Motif design using proposed heptad 
repeat positioning of amino acids of DP107 and DP178. 
10 FIG. 14. Hybrid motif design crossing GCN4 

and DP107. 

FIG. 15. Hybrid motif design crossing GCN4 

and DP178. 

FIG . 16. Hybrid motif design 107x178x4, 
crossing DP107 and DP178. This motif was found to be 

15 

the most consistent at identifying relevant DP107-like 
and DP178-like peptide regions. 

FIG. 17. Hybrid motif design crossing GCN4 , 
DP107, and DP178. 

FIG. 18. Hybrid motif design ALLMOTI5 
20 crossing GCN4 , DP107, DP178, c-Fos c-Jun, c-Myc, and 
Flu Loop 3 6 . 

FIG. 19. PLZIP motifs designed to identify 
N- terminal proline -leucine zipper motifs. 

FIG. 20. Search results for HIV-1 (BRU 
25 isolate) enveloped protein gp41. Sequence search 

motif designations: Spades (±) : 107x178x4; Hearts (V) 
ALLM0TI5; Clubs (*} : PLZIP; Diamonds (♦) : 
transmembrane region (the putative transmembrane 
domains were identified using a PC/Gene program 
designed to search for such peptide regions) . 
Asterisk (*) : Lupas method. The amino acid sequences 
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identified by each motif are bracketed by the 
respective characters. Representative sequences 
chosen based on 107x178x4 searches are underlined and 
in bold. DP107 and DP178 sequences are marked, and 
additionally double-underlined and italicized. 
5 FIG. 21. Search results for human 

respiratory syncytial virus (RSV) strain A2 fusion 
glycoprotein Fl. Sequence search motif designations 
are as in FIG. 20. 

FIG. 22. Search results for simian 
10 immunodeficiency virus (SIV) enveloped protein gp41 
(AGM3 isolate) . Sequence search motif designations 
are as in FIG. 20 . 

FIG. 23. Search results for canine 

distemper virus (strain Onderstepoort) fusion 

glycoprotein l. Sequence search motif designations 
15 * 
are as in FIG. 20. 

FIG. 24. Search results for newcastle 
disease virus (strain Australia-Victoria/32) fusion 
glycoprotein Fl. Sequence search motif designations 
are as in FIG. 20. 
20 FIG. 25. Search results for human 

parainfluenza 3 virus (strain NIH 47885) fusion 
glycoprotein Fl. Sequence search motif designations 
are as in FIG. 20. 

FIG. 26. Search results for influenza A 
25 virus (strain A/AICHI/2/68) hemagglutinin precursor 
HA2. Sequence search designations are as in FIG. 20. 

FIG. 27A-D. Respiratory Syncytial Virus 
(RSV) peptide antiviral and circular dichroism data. 
FIG. 27A-B: Peptides derived from the F2 DP178/DP107- 
like region. Antiviral and CD data. FIG. 2 7C-D: 
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Peptides derived from the Fl DP107-like region. 
Peptide and CD data. 

Antiviral activity (AV) is represented by the 
following qualitative symbols: 

, negative antiviral activity; 
5 "+/-", antiviral activity at greater than 

10 0/xg/ml; 

'» + ", antiviral activity at between 50-100/ig/ml; 
"++", antiviral activity at between 20-50/zg/ml; 
"+++", antiviral activity at between 1-20/zg/ml; 
10 "++++", antiviral activity at <l/zg/ml. 

CD data, referring to the level of helicity is 
represented by the following qualitative symbol: 
, no helicity; 
" + ", 25-50% helicity; 

50-75% helicity; 
n +++ „, 75 . 100 % helicity. 

IC 50 refers to the concentration of peptide 
necessary to produce only 50% of the number of 
syncytial relative to infected control cultures 
containing no peptide. IC S0 values were obtained using 

20 purified peptides only. 

FIG. 2 8A-B. Respiratory Syncytial Virus 
(RSV) DP178-like region (Fl) peptide antiviral and CD 
data. Antiviral symbols, CD symbols, and IC S0 are as 
in FIG. 27A-D. IC 50 values were obtained using 

25 purified peptides only. 

FIG. 29A-B. Peptides derived from the HPIV3 
Fl DPl07-like region. Peptide antiviral and CD data. 
Antiviral symbols, CD symbols, and IC 50 are as in FIG. 
27A-D. Purified peptides were used to obtain IC 50 
values, except where the values are marked by an 
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asterisk (*) , in which cases, the IC 50 values were 
obtained using a crude peptide preparation. 

FIG. 2 9C. HPIV3 peptide T-184 CD spectrum 
at 1°C in 0.1M NaCl lOmM KP0 4 , pH 7.0. The data 
demonstrates the peptide's helical secondary structure 
5 (0222/2oa =:1 - 2 ) over a wide range of concentrations (100- 
1500^M) . This evidence is consistent with the peptide 
forming a helical coiled- coil structure. 

FIG. 3 0A-B. Peptides derived from the HPIV3 
Fl DPl78-like region. Peptide antiviral and CD data. 
!0 Antiviral symbols, CD symbols, and IC 50 are as in FIG. 
2 7A-D. Purified peptides were used to obtain IC S0 
values, except where the values are marked by an 
asterisk (*) , in which cases, the IC 50 values were 
obtained using a crude peptide preparation. 

FIG. 31. Motif search results for simian 

15 

immunodeficiency virus (SIV) isolate MM251, enveloped 
polyprotein gp41. Sequence search designations are as 
in FIG. 20 . 

FIG. 32. Motif search results for Epstein- 
Barr Virus (Strain B95-8) , glycoprotein gpllO 
20 precursor (designated gpllS) . BALF4 . Sequence search 
designations are as in FIG. 20. 

FIG. 33. Motif search results for Epstein- 
Barr Virus (Strain B95-8) , BZLF1 trans -activator 
protein (designated EB1 or Zebra) . Sequence search 
25 designations are as in FIG. 20. Additionally, n @ tt 
refers to a well known DNA binding domain and 11 + " 
refers to a well known dimerization domain, as defined 
by Flemington and Speck (Flemington, E. and Speck, 
S.H., 1990, Proc. Natl. Acad. Sci. USA 87:9459-9463). 

FIG. 34. Motif search results for measles 

30 
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virus (strain Edttionston) , fusion glycoprotein Pi. 
Sequence search designations are as in FIG. 20. 

FIG. 35. Motif search results for Hepatitis 
B Virus (Subtype AYW) , major surface antigen precursor ^ 
S. Sequence search designations are as in FIG. 20. 
5 FIG. 36. Motif search results for simian 

Mason-Pfizer monkey virus, enveloped (TM) protein 
gp20. Sequence search designations are as in FIG. 20. 

FIG. 37. Motif search results for 
Pseudomonas aerginosa, f imbrial protein (Pilin) . 
10 Sequence search designations are as in FIG. 20. 

FIG. 38. Motif search results for Neisseria 
gonorrhoeae f imbrial protein (Pilin) . Sequence search 
designations are as in FIG. 20. 

FIG. 39. Motif search results for 
Hemophilus influenzae fimbria! protein. Sequence 
search designations are as in FIG. 20. 

FIG. 40. Motif search results for 
Staphylococcus aureus, toxic shock syndrome toxin- 1. 
Sequence search designations are as in FIG. 20. 

FIG. 41. Motif search results for 
20 Staphylococcus aureus enterotoxin Type E. Sequence 
search designations are as in FIG. 20. 

FIG. 42. Motif search results for 
Staphylococcus aureus enterotoxin A. Sequence search 
designations are as in FIG. 20, 
25 FIG. 43. Motif search results for 

Escherichia coli, heat labile enterotoxin A. Sequence 
search designations are as in FIG. 20. 

FIG. 44. Motif search results for human c- 
fos proto-oncoprotein. Sequence search designations 
are as in FIG. 20 . 
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FIG. 45. Motif search results for human 
lupus KU autoantigen protein P70. Sequence search 
designations are as in FIG. 20. 

FIG. 46. Motif search results for human 
zinc finger protein 10. Sequence search designations 
5 are as in FIG. 20. 

FIG. 47. Measles virus (MeV) fusion protein 
DP178-like region antiviral and CD data. Antiviral 
symbols, CD symbols, and IC 50 are as in FIG. 27A-D. 
IC S0 values were obtained using purified peptides. 
10 FIG. 48. Simian immunodeficiency virus 

(SIV) TM (fusion) protein DPl78-like region antiviral 
data. Antiviral symbols are as in FIG. 27A-D "NT", 
not tested. 

FIG. 4 9A-C. DP178 -derived peptide antiviral 
data. The peptides listed herein were derived from 
the region surrounding the HIV-1 BRU isolate DP178 
region ( e.g. , gp41 amino acid residues 615-717) . 

In instances where peptides contained DP178 point 
mutations, the mutated amino acid residues are shown 
with a shaded background. In instances in which the 
test peptide has had an amino and/or carboxy-terminal 
group added or removed (apart from the standard amido- 
and acetyl- blocking groups found on such peptides) , 
such modifications are indicated. FIG. 49A: The 
column to the immediate right of the name of the test 

25 peptide indicates the size of the test peptide and 
points out whether the peptide is derived from a one 
amino acid peptide "walk" across the DP178 region. 
The next column to the right indicates whether the 
test peptide contains a point mutation, while the 

3q column to its right indicates whether certain amino 
acid residues have been added to or removed from the 
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DP178-derived amino acid sequence. FIG 49B: The 
column to the immediate right of the test peptide name 
indicates whether the peptide represents a DP178 
truncation, the next column to the right points out 
whether the peptide contains a point mutation, and the 
5 column to its right indicates whether the peptide 
contains amino acids which have been added to or 
removed from the DP178 sequence itself. FIG. 49C: 
The column to the immediate right of the test peptide 
name indicates whether the test peptide contains a 

10 point mutation, while the column to its right 

indicates whether amino acid residues have been added 
to or removed from the DP178 sequence itself. IC 50 is 
as defined in FIG. 27A-D, and IC 50 values were obtained 
using purified peptides except where marked with an 

^ asterisk (*) , in which case the IC 50 was obtained using 
a crude peptide preparation. 

FIG. 50. DP107 and DP107 gp41 region 
truncated peptide antiviral data. IC 50 as defined in 
FIG. 2 7A-D, and IC 50 values were obtained using 
purified peptides except where marked with an asterisk 

20 (*) , in which case the IC 50 was obtained using a crude 
peptide preparation. 

FIG. 51A-B. Epstein-Barr virus Strain B95-8 
BZLFl DP178/DP107 analog region peptide walks and 
electrophoretic mobility shift assay results. The 

25 peptides (T-423 to T-446, FIG. 51A; T-447 to T-461, 
FIG. 51B) represent one amino acid residue "walks" 
through the EBV Zebra protein region from amino acid 
residue 173 to 246. 

The amino acid residue within this region which 
corresponds to the first amino acid residue of each 

•3 U 

peptide is listed to the left of each peptide, while 
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the amino acid residue within this region which 
corresponds to the last amino acid residue of each 
peptide is listed to the right of each peptide. The 
length of each test peptide is listed at the far right 
of each line, under the heading "Res". 
5 "ACT" refers to a test peptide's ability to 

inhibit Zebra binding to its response element. "+" 
refers to a visible, but incomplete, abrogation of the 
response element /Zebra homodimer complex; "+++" refers 
to a complete abrogation of the complex; and "-" 

10 represents a lack of complex disruption. 

FIG. 52A-B. Hepatitis B virus subtype AYW major 
surface antigen precursor S protein DP178/DP107 analog 
region and peptide walks. 52A depicts Domain I (S 
protein amino acid residues 174-22 0) , which contains a 

^ potential DP178/DP107 analog region. In addition, 
peptides are listed which represent one amino acid 
peptide "walks" through domain I. 52B depicts Domain 
II (S protein amino acid residues 233-291) , which 
contains a second potential DP178/DP107 analog region. 
In addition, peptides are listed which represent one 

20 amino acid peptide "walks" through domain II. 

FIG. 53: Cell fusion and competitive inhibition 
data for alanine walk experiments for the DP178-like 
Respiratory Syncytial Virus (RSV) peptide T112. 

FIG. 54: Circular dichroism, cell fusion and 

25 competitive inhibition data for alanine walk 

experiments for the peptide T20, which is also known 
as DP178. 



5. DETAILED DESCRIPTION OF THE INVENTION 
Described herein are peptides which may exhibit 

30 

antifusogenic activity, antiviral capability, and/or 



- 23 - 



WO 01/51673 



PCT/US00/35727 



the ability to modulate intracellular processes 
involving coiled-coil peptide structures. The 
peptides described include, first, DP178 (SEQ ID 
NO:l), a gp41-derived 36 amino acid peptide and 
fragments and analogs of DP178 . 
5 In addition, the peptides of the invention 

described herein include peptides which are DP107 
analogs. DP107 (SEQ ID NO: 25) is a 38 amino acid 
peptide corresponding to residues 558 to 595 of the 
HIV-luu transmembrane (TM) gp41 protein. Such DP107 
10 analogs may exhibit antifusogenic capability, 
antiviral activity or an ability to modulate 
intracellular processes involving coiled-coil 
structures. 

Further, peptides of the invention include DP107 
and DP178 are described herein having amino acid 
sequences recognized by the 107x178x4, ALLM0TI5 * and 
PLZIP search motifs. Such motifs are also discussed. 

Also described here are antifusogenic, antiviral, 
intracellular modulatory, and diagnostic uses of the 
peptides of the invention. Further, procedures are 
20 described for the use of the peptides of the invention 
for the identification of compounds exhibiting 
antifusogenic, antiviral or intracellular modulatory 
activity. 

While not limited to any theory of operation, the 
25 following model is proposed to explain the potent 
anti-HIV activity of DP178, based, in part, on the 
experiments described in the Examples, infra. In the 
HIV protein, gp41, DP178 corresponds to a putative a- 
helix region located in the C-terminal end of the gp41 
3o ectodomain, and appears to associate with a distal 

site on gp41 whose interactive structure is influenced 
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by the leucine zipper motif, a coiled- coil structure, 
referred to as DP107. The association of these two 
domains may reflect a molecular linkage or "molecular 
clasp" intimately involved in the fusion process. It 
is of interest that mutations in the C- terminal a- 
5 helix motif of gp41 ( i.e. . the D178 domain) tend to 
enhance the fusion ability of gp41, whereas mutations 
in the leucine zipper region ( i.e. , the DPI 07 domain) 
decrease or abolish the fusion ability of the viral 
protein. It may be that the leucine zipper motif is 

10 involved in membrane fusion while the C-terminal a- 
helix motif serves as a molecular safety to regulate 
the availability of the leucine zipper during virus - 
induced membrane fusion. 

On the basis of the foregoing, two models are 

^ proposed of gp4l -mediated membrane fusion which are 
schematically shown in FIG. 11A-B. The reason for 
proposing two models is that the temporal nature of 
the interaction between the regions defined by DP107 
and DPI 78 cannot, as yet, be pinpointed. Each model 
envisions two conformations for gp4l - one in a 

20 "native" state as it might be found on a resting 

virion. The other in a "fusogenic" state to reflect 
conformational changes triggered following binding of 
gp!2 0 to CD4 and just prior to fusion with the target 
cell membrane. The strong binding affinity between 

25 gpl2 0 and CD4 may actually represent the trigger for 
the fusion process obviating the need for a pH change 
such as occurs for viruses that fuse within 
intracellular vesicles. The two major features of 
both models are: (1) the leucine zipper sequences 

^ (DP107) in each chain of oligomeric enveloped are held 
apart in the native state and are only allowed access 
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to one another in the fusogenic state so as to form 
the extremely stable coiled-coils, and (2) association 
of the DP178 and DP107 sites as they exist in gp41 
occur either in the native or fusogenic state. FIG. 
11A depicts DP178/DP107 interaction in the native 
5 state as a molecular clasp. On the other hand, if one 
assumes that the most stable form of the enveloped 
occurs in the fusogenic state, the model in FIG. 11B 
can be considered. 

When synthesized as peptides, both DP107 and 
10 DP178 are potent inhibitors of HIV infection and 

fusion, probably by virtue of their ability to form 
complexes with viral gp41 and interfere with its 
fusogenic process; e.g. , during the structural 
transition of the viral protein from the native 
structure to the fusogenic state, the DP178 and DP107 

15 

peptides may gain access to their respective binding 
sites on the viral gp41, and exert a disruptive 
influence. DP107 peptides which demonstrate anti-HIV 
activity are described in Applicants 1 co-pending 
application Serial No. 08/264,531, filed June 23, 
20 1994, which is incorporated by reference herein in its 
entirety. 

As shown in the Examples, infra , a truncated 
recombinant gp41 protein corresponding to the 
ectodomain of gp41 containing both DP107 and DP178 
2 5 domains (excluding the fusion peptide, transmembrane 
region and cytoplasmic domain of gp41) did not inhibit 
HIV-1 induced fusion. However, when a single mutation 
was introduced to disrupt the coiled-coil structure of 
the DP107 domain --a mutation which results in a 
total loss of biological activity of DP107 peptides -- 
the inactive recombinant protein was transformed to an 
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active inhibitor of HIV-l induced fusion. This 
transformation may result from liberation of the 
potent DP178 domain from a molecular clasp with the 
leucine zipper, DP107 domain. 

For clarity of discussion, the invention will be 
described primarily for DP178 peptide inhibitors of 
HIV. However, the principles may be analogously 
applied to other viruses, both enveloped and 
nonenveloped, and to other non-viral organisms. 



15 
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5.1. DP178 AND DP178-LIKE PEPTIDES 
The DP178 peptide {SEQ ID:1) of the invention 
corresponds to amino acid residues 638 to 673 of the 
transmembrane protein gp41 from the HIV-1^ isolate, 
and has the 3 6 amino acid sequence (reading from amino 
5 to carboxy terminus) : 

NH 2 -YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-COOH (SEQ ID:1) 

In addition to the full-length DP178 (SEQ ID:1) 

!0 3 6-mer, the peptides of the invention may include 
truncations of the DP178 (SEQ ID:1) peptide which 
exhibit ant if usogenic activity, antiviral activity 
and/or the ability to modulate intracellular processes 
involving coiled-coil peptide structures. Truncations 

^ of DP178 (SEQ ID:1) peptides may comprise peptides of 
between 3 and 3 6 amino acid residues ( i.e. . peptides 
ranging in size from a tripeptide to a 3 6-mer 
polypeptide) , as shown in Tables I and IA, below. 
Peptide sequences in these tables are listed from 
amino (left) to carboxy (right) terminus. "X" may 

20 represent an amino group (-NH 2 ) and "Z" may represent a 
carboxyl (-COOH) group. Alternatively, "X" may 
represent a hydrophobic group, including but not 
limited to carbobenzyl, dansyl, or T-butoxycarbonyl ; 
an acetyl group; a 9-f luorenylmethoxy-carbonyl (FMOC) 

25 group; or a covalently attached macromolecular group, 
including but not limited to a lipid-fatty acid 
conjugate, polyethylene glycol, carbohydrate or 
peptide group. Further, n z n may represent an amido 
group; a T-butoxycarbonyl group; or a covalently 

^ attached macromolecular group, including but not 

limited to a lipid-fatty acid conjugate, polyethylene 
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glycol, carbohydrate or peptide group. A preferred 
"X" or lf Z n macromolecular group is a peptide group. 



TABLE I 

DP178 (SEP ID:1) CARBOXY TRUNCATIONS 

5 X-YTS-Z 

X-YTSL-Z 

X-YTSLI-Z 

X-YTSLIH-Z 

X-YTSLIHS-Z 

X-YTSLIHSL-Z 

X-YTSLIHSLI-Z 

X-YTSLIHSLIE-Z 
10 X-YTSLIHSLIEE-Z 

X-YTSLIHSLIEES-Z 

X-YTSLIHSLIEESQ-Z 

X-YTSLIHSLIEESQN-Z 

X-YTSLIHSLIEESQNQ-Z 

X-YTSLIHSLIEESQNQQ-Z 

X - YTSLIHSLIEESQNQQE - Z 

X-YTSLIHSLIEESQNQQEK-Z 
15 X-YTSLIHSLIEESQNQQEKN-Z 

X-YTSLIHSLIEESQNQQEKNE-Z 

X-YTSLIHSLIEESQNQQEKNEQ-Z 

X-YTSLIHSLIEESQNQQEKNEQE-Z 

X-YTSLIHSLIEESQNQQEKNEQEL-Z 

X-YTSLIHSLIEESQNQQEKNEQELL-Z 

X - YTS LIHSLIEES QNQQE KNEQELLE - Z 

X-YTSLIHSLIEESQNQQEKNEQELLEL-Z 
2 0 X- YTSLIHSLIEESQNQQEKNEQELLELD- Z 

X - YTSL IHSLIEESQNQQEKNEQELLELDK- Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKW-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWA-Z 

X - YTSLIHSLIEESQNQQEKNEQELLELDKWAS - Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASL-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLW-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN-Z 
2 5 X- YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNW- Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 



The one letter amino acid code is used. 



30 
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TABLE IA 

DP178 (SEP ID:1) AMINO TRUNCATIONS 

X-NWF-Z 
X-WNWF-Z 
X-LWNWF-Z 
X-SLWNWF-Z k 

5 X-ASLWNWF-Z 

X-WASLWNWF-Z 
X-KWASLWNWF-Z 
X-DKWASLWNWF-Z 
X - LDKWASLWNWF - Z 
X - ELDKWAS LWNWF - Z 
X - LELDKWASLWNWF - Z 
X - LLELDKWASLWNWF - Z 

10 X - ELLELDKWAS LWNWF - Z 

X - QELLELDKWAS LWNWF - Z 
X - EQELLELDKWAS LWNWF - Z 
X -NEQELLELDKWASLWNWF - Z 
X- KNEQELLELDKWAS LWNWF - Z 
X - EKNEQELLELDKWASLWNWF Z 
X-QEKNEQELLELDKWASLWNWF - Z 
X - QQEKNEQELLELDKWASLWNWF - Z 
15 X -NQQEKNEQELLELDKWASLWNWF- Z 

X -QNQQEKNEQELLELDKWASLWNWF - Z 
X - S QNQQE KNEQELLE LDKWAS LWNWF - Z 
X -ESQNQQEKNEQELLELDKWASLWNWF - Z 
X - EE S QNQQEKNEQELLELDKWAS LWNWF - Z 
X- I E E S QNQQEKNEQELLELDKWAS LWNWF - Z 
X -LIEESQNQQEKNEQELLELDKWAS LWNWF - Z 
X-SLIEESQNQQEKNEQELLELDKWASLWNWF-Z 
2 0 X-HSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 

X- IHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 
X -LIHSLIEESQNQQEKNEQELLELDKWASLWNWF - Z 
X-SLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 
X-TSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 
X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 



The one letter amino acid code is used. 

25 
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The peptides of the invention also include DP178- 
like peptides. "DP178-like" , as used herein, refers, 
first, to DP178 and DP178 truncations which contain 
one or more amino acid substitutions, insertions 
and/or deletions. Second, "DP-178-like M refers to 
5 peptide sequences identified or recognized by the 

ALLMOTI5, 107x17 8x4 and PLZIP search motifs described 
herein, having structural and/or amino acid motif 
similarity to DP178. The DP178-like peptides of the 
invention may exhibit antifusogenic or antiviral 
10 activity, or may exhibit the ability to modulate 
intracellular processes involving coiled^coil 
peptides. Further, such DP178-like peptides may 
possess additional advantageous features, such as, for 
example, increased bioavailability, and/or stability, 
or reduced host immune recognition. 

15 

HIV-1 and HIV-2 enveloped proteins are 
structurally distinct, but there exists a striking 
amino acid conservation within the DP178 -corresponding 
regions of HIV-1 and HIV-2. The amino acid 
conservation is of a periodic nature, suggesting some 

20 conservation of structure and/or function. Therefore, 
one possible class of amino acid substitutions would 
include those amino acid changes which are predicted 
to stabilize the structure of the DP178 peptides of 
the invention. Utilizing the DP178 and DP178 analog 

25 sequences described herein, the skilled artisan can 

readily compile DP178 consensus sequences and 

ascertain from these, conserved amino acid residues 

which would represent preferred amino acid 

substitutions . 

The amino acid substitutions may be of a 
30 J 

conserved or non- conserved nature. Conserved amino 

acid substitutions consist of replacing one or more 
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preferred amino terminal or carboxy terminal amino 
acid insertion would contain gp41 amino acid sequences 
found immediately amino to or carboxy to the DP178 
region of the gp41 protein. 

Deletions of DP178 (SEQ ID:1) or DP178 
5 truncations are also within the scope of the 

invention. Such deletions consist of the removal of 
one or more amino acids from the DP178 or DP178-like 
peptide sequence, with the lower limit length of the 
resulting peptide sequence being 4 to 6 amino acids. 

10 Such deletions may involve a single contiguous or 
greater than one discrete portion of the peptide 
sequences. One or more such deletions may be 
introduced into DP178 (SEQ.ID:!) or DP178 truncations, 
as long as such deletions result in peptides which may 

^ still be recognized by the 107x178x4, ALLMOTI5 or 
PLZIP search motifs described- herein, or may, 
alternatively, exhibit antifusogenic or antiviral 
activity, or exhibit the ability to modulate 
intracellular processes involving coiled-coil peptide 
structures . 

20 DP178 analogs are further described, below, in 

Section 5.3. 

5.2. PP107 AND DP107-LIKE PEPTIDES 
Further, the peptides of the invention include 
25 peptides having amino acid sequences corresponding to 
DP107 analogs. DP107 is a 38 amino acid peptide which 
exhibits potent antiviral activity, and corresponds to 
residues 558 to 595 of HIV-1^ transmembrane (TM) gp41 
protein, as shown here: 

30 

NH 2 -NNLLRAIEAQQHLLQLTVWQIKQLQARILAVERYLKDQ- COOH 

(SEQ ID:25) 
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In addition to the full-length DP107 (SEQ ID:25) 
38-mer, the peptides of the invention may include 
truncations of the DP107 (SEQ ID: 25) peptide which 
exhibit antifusogenic activity, antiviral activity 
and/or the ability to modulate intracellular processes • 
5 involving coiled-coil peptide structures. Truncations 
of DP107 {SEQ ID: 25) peptides may comprise peptides of 
between 3 and 3 8 amino acid residues ( i >e . . peptides 
ranging in size from a tripeptide to a 3 8-mer 
polypeptide) , as shown in Tables II and IIA, below. 

10 Peptide sequences in these tables are listed from 
amino (left) to carboxy (right) terminus ; "X" may 
represent an amino group ( -NH 2 ) and 11 Z" may represent a 
carboxyl (-COOH) group. Alternatively, "X" may 
represent a hydrophobic group, including but not 

^ limited to carbobenzyl, dansyl, or T-butoxycarbonyl; 
an acetyl group; a 9-f luorenylmethoxy-carbonyl (FMOC) 
group; or a covalently attached macromolecular group, 
including but not limited to a lipid-fatty acid 
conjugate, polyethylene glycol, carbohydrate or 
peptide group. Further, "Z" may represent an amido 

20 group; a T-butoxycarbonyl group; or a covalently 
attached macromolecular group, including but not 
limited to a lipid-fatty acid conjugate, polyethylene 
glycol, carbohydrate or peptide group. A preferred 
"X" or "Z" macromolecular group is a peptide group. 



30 
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TABLE II 

DP107 (SEP ID: 25) CARBOXY TRUNCATIONS 



X-NNL-Z 

X-NNLL-Z 

X-NNLLR-Z 
5 X-NNLLRA-Z 

X-NNLLRAI-Z 

X - NNLLRAIE - Z 

X-NNLLRAIEA-Z 

X -NNLLRAI E AQ - Z 

X - NNLLRAI E AQQ - Z 

X- NNLLRAI EAQQH- Z 

X -NNLLRAIEAQQHL - Z 
10 X -NNLLRAIEAQQHLL - Z 

X - NNLLRAI E AQQHLLQ - Z 

X-NNLLRAIEAQQHLLQL - Z 

X - NNLLRAI EAQQHLLQLT - Z 

X - NNLLRAI E AQQHLLQLT V - Z 

X - NNLLRAI EAQQHLLQLTVW - Z 

X - NNLLRAI EAQQHLLQLT VWQ - Z 

X - NNLLRAI EAQQHLLQLT VWQ I - Z 
!5 X - NNLLRAI E AQQHLLQLTVWQ I K - Z 

X-NNLLRAIEAQQHLLQLTVWQIKQ- Z 

X - NNLLRAI EAQQHLLQLT VWQ I KQL - Z 

X - NNLLRAI EAQQHLLQLTVWQ I KQLQ - Z 

X-NNLLRAIEAQQHLLQLTVWQIKQLQA- Z 

X-NNLLRAIEAQQHLLQLTVWQIKQLQAR- Z 

X - NNLLRAI E AQQHLLQLTVWQ I KQLQAR I - Z 

X - NNLLRAI E AQQHLLQLTVWQ I KQLQAR I L - Z 
2 0 X-NNLLRAIEAQQHLLQLTVWQIKQLQARILA- Z 

X - NNLLRAI E AQQHLLQLTVWQ I KQLQAR I LAV - Z 

X - NNLLRAI EAQQHLLQLTVW Q I KQLQAR I LAVE - Z 

X - NNLLRAI EAQQHLLQLT VWQ I KQLQAR I LAVER - Z 

X - NNLLRAI E AQQHLLQL TVWQ I KQLQAR I LAVER Y - Z 

X - NNLLRAI EAQQHLLQLT VWQ I KQLQ ARI LAVERYL - Z 

X-NNLLRAIEAQQHLLQLTVWQIKQLQARILAVERYLK-Z 

X - NNLLRAI E AQQHLLQ LT VWQ I KQLQAR I L AVERYLKD - Z 
25 X - NNLLRAI E AQQHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 



The one letter amino acid code is used. 
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TABLE IIA 
DPI 7 8 (SEP ID: 25) AMINO TRUNCATIONS 



X-KDQ- Z 
X-LKDQ- Z 
X-YLKDQ- Z 

5 ' X-RYLKDQ- Z 

X-ERYLKDQ- Z 
X-VERYLKDQ- Z 
X-AVERYLKDQ- Z 
X - LAVERYLKDQ - Z 
X - I LAVERYLKDQ - Z 
X - RILAVERYLKDQ - Z 
X - ARI LAVERYLKDQ - Z 

10 X-QARILAVERYLKDQ- Z 

X-LQARILAVERYLKDQ- Z 
X - QLQARILAVERYLKDQ - Z 
X - KQLQARI LAVERYLKDQ - Z 
X- IKQLQARILAVERYLKDQ- Z 
X - Q I KQLQARILAVER YLKDQ - .Z 
X-WQ IKQLQARILAVERYLKDQ- Z 
X - VWQ I KQLQARI LAVERYLKDQ - Z 
15 X - TVWQ I KQLQARI LAVERYLKDQ - Z 

X - LTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - QLTVWQ I KQLQARI LAVERYLKDQ - Z 
X-LQLTVWQIKQLQARILAVERYLKDQ- Z 
X - LLQLTVWQ IKQLQARI LAVERYLKDQ - Z 
X - HLLQLTVWQ I KQLQARI LAVERYLKDQ - Z 
X - QHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - QQHLLQLTVWQ I KQLQARI LAVERYLKDQ - Z 
20 X-AQQHLLQLTVWQIKQLQARILAVERYLKDQ- Z 

X - EAQQHLLQLTVWQI KQLQARILAVERYLKDQ - Z 
X- IEAQQHLLQLTVWQIKQLQARILAVERYLKDQ- Z 
X - AI E AQQHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - RAI E AQQHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - LRAIEAQQHLLQLTVWQIKQLQARILAVERYLKDQ - Z 
X - LLRA I E AQQHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - NLLRAI E AQQHLLQLTVWQ I KQ LQ AR I LAVERYLKDQ - Z 
25 X - NNLLRAI E AQQHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 



The one letter amino acid code is used. 
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The peptides of the invention also include DP107- 
like peptides. "DPl07-like n , as used herein, refers, 
first, to DP107 and DP107 truncations which contain 
one or more amino acid substitutions, insertions 
and/or deletions. Second, !, DP-l07-like" refers to 
5 peptide sequences identified or recognized by the 

ALLMOTI5, 107x178x4 and PLZIP search motifs described 
herein, having structural and/or amino acid motif 
similarity to DP107. The DP107-like peptides of the 
invention may exhibit antifusogenic or antiviral 
10 activity, or may exhibit the ability to modulate 
intracellular processes involving coiled-coil 
peptides. Further, such DP107-like peptides may 
possess additional advantageous features, such as, for 
example, increased bioavailability, and/or stability, 
or reduced host immune recognition. 

15 

HIV-1 and HIV-2 enveloped proteins are 
structurally distinct, but there exists a striking 
amino acid conservation within the DP10 7 -corresponding 
regions of HIV-1 and HIV-2. The amino acid 
conservation is of a periodic nature, suggesting some 

20 conservation of structure and/or function. Therefore, 
one possible class of amino acid substitutions would 
include those amino acid changes which are predicted 
to stabilize the structure of the DP107 peptides of 
the invention. Utilizing the DP107 and DP107 analog 

25 sequences described herein, the skilled artisan can 
readily compile DP107 consensus sequences and 
ascertain from these, conserved amino acid residues 
which would represent preferred amino acid 
substitutions. 

The amino acid substitutions may be of a 

30 

conserved or non-conserved nature. Conserved amino 
acid substitutions consist of replacing one or more 
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amino acids of the DP107 (SEQ ID: 25) peptide sequence 
with amino acids of similar charge, size, and/or 
hydrophobicity characteristics, such as, for example, 
a glutamic acid (E) to aspartic acid (D) amino acid 
substitution. Non-conserved substitutions consist of . 
5 replacing one or more amino acids of the DP107 (SEQ 
ID: 25) peptide sequence with amino acids possessing 
dissimilar charge, size, and/or hydrophobicity 
characteristics, such as, for example, a glutamic acid 
(E) to valine (V) substitution. 

10 Amino acid insertions may consist of single amino 

acid residues or stretches of residues. • The 
insertions may be made at the carboxy or amino 
terminal end of the DP1G7 or DP107 truncated peptides, 
as well as at a position internal to the peptide. 
Such insertions will generally range from 2 to 15 
amino acids in length. It is contemplated that 
insertions made at either the carboxy or amino 
terminus of the peptide of interest may be of a 
broader size range, with about 2 to about 50 amino 
acids being preferred. One or more such insertions 

20 may be introduced into DP107 (SEQ. ID: 25) or DP107 
truncations, as long as such insertions result in 
peptides which may still be recognized by the 
107x178x4, ALLMOTI5 or PLZIP search motifs described 
herein, or may, alternatively, exhibit antif usogenic 

25 °r antiviral activity, or exhibit the ability to 

modulate intracellular processes involving coiled-coil 
peptide structures. 

Preferred amino or carboxy terminal insertions 
are peptides ranging from about 2 to about 50 amino 

3 0 acid residues in length, corresponding to gp41 protein 
regions either amino to or carboxy to the actual DP107 
gp41 amino acid sequence, respectively. Thus, a 
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preferred amino terminal or carboxy terminal amino 
acid insertion would contain gp4l amino acid sequences 
found immediately amino to or carboxy to the DPI 07 
region of the gp4l protein. 

Deletions of DP107 (SEQ ID:25) or DP178 
5 truncations are also within the scope of the 

invention. Such deletions consist of the removal of 
one or more amino acids from the DP107 or DP107-like 
peptide sequence, with the lower limit length of the 
resulting peptide sequence being 4 to 6 amino acids. 

10 Such deletions may involve a single contiguous or 
greater than one discrete portion of the. peptide 
sequences. One or more such deletions may be 
introduced into DP107 (SEQ. ID: 25) or DP107 
truncations, as long as such deletions result in 
peptides which may still be recognized by the 
107x178x4, ALLMOTI5 or PLZIP search motifs described 
herein, or may, alternatively, exhibit antifusogenic 
or antiviral activity, or exhibit the ability to 
modulate intracellular processes involving coiled-coil 
peptide structures. 

20 DP107 and DP107 truncations are more fully 

described in Applicants' co-pending U.S. Patent 
Application Ser. No. 08/374,666, filed January 27, 
1995, and which is incorporated herein by reference in 
its entirety. DP107 analogs are further described, 

25 below, in Section 5.3. 

5.3 . DP 107 and DPI 7 8 ANALOGS 

Peptides corresponding to analogs of the DP178, 
DP178 truncations, DP107 and DP107 truncation 
3q sequences of the invention, described, above, in 
Sections 5.1 and 5.2 may be found in other viruses, 
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including, for example, non-HIV-l^ enveloped viruses, 
non-enveloped viruses and other non-viral organisms. 

The term "analog" , as used herein, refers to a 
peptide which is recognized or identified via the 
107x178x4, ALLMOTI5 and/or PLZIP search strategies 
5 discussed below. Further, such peptides may exhibit 
antifusogenic capability, antiviral activity, or the 
ability to modulate intracellular processes involving 
coiled-coil structures. 

Such DP178 and DP107 analogs may, for example, 

10 correspond to peptide sequences present in TM proteins 
of enveloped viruses and may, additionally correspond 
to peptide sequences present in non enveloped and non- 
viral organisms. Such peptides may exhibit 
antifusogenic activity, antiviral activity, most 
particularly antiviral activity which is specific to 
the virus in which their native sequences are faund, 
or may exhibit an ability to modulate intracellular 
processes involving coiled-coil peptide structures. 

DP178 analogs are peptides whose amino acid 
sequences are comprised of the amino acid sequences of 

20 peptide regions of, for example, other ( i.e. , other 
than HIV-l^j) viruses that correspond to the gp41 
peptide region from which DP178 (SEQ ID:1) was 
derived. Such viruses may include, but are not 
limited to, other HIV-1 isolates and HIV-2 isolates. 

25 DP178 analogs derived from the corresponding gp4l 
peptide region of other ( i.e. , non HIV-1^) HIV-1 
isolates may include, for example, peptide sequences 
as shown below. 

3 NH 2 - YTNTI YTLLEESQNQQEKNEQELLELDKWASLWNWF - COOH 

(DP- 185; SEQ ID: 3) ; 
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NH 2 -YTGIIYNLLEESQNQQEKNEQELLELDKWANLWNWF-COOI{SEQ ID:4) ; 

NH 2 -YTSLIYSLLEKSQIQQEKNEQELLELDKWASLWNWF-COOH 

(SEQ ID:5) . 

5 SEQ ID:3 (DP-185) , SEQ ID:4, and SEQ ID:5 are derived 
from HIV-1 SF2/ HIV-1 RF , and HIV-l^ isolates, 
respectively. Underlined amino acid residues refer to 
those residues that differ from the corresponding 
position in the DP178 (SEQ ID:1) peptide. One such 
!0 DP178 analog, DP-185 (SEQ ID:3), is described in the 
Example presented in Section 6, below, where it is 
demonstrated that DP-185 (SEQ ID: 3) exhibits antiviral 
activity. The DP178 analogs of the invention may also 
include truncations, as described above. Further, the 
analogs of the invention modifications such those 

15 

described for DP178 analogs in Section 5.1., above. 
It is preferred that the DP178 analogs of the 
invention represent peptides whose amino acid 
sequences correspond to the DP178 region of the gp41 
protein, it is also contemplated that the peptides of 

20 the invention may, additionally, include amino 

sequences, ranging from about 2 to about 5 0 amino acid 
residues in length, corresponding to gp41 protein 
regions either amino to or carboxy to the actual DP178 
amino acid sequence. 

25 Striking similarities, as shown in FIG. 1, exist 

within the regions of HIV-1 and HIV- 2 isolates which 
correspond to the DP178 sequence. A DP178 analog 
derived from the HIV-2 KIKZ isolate has the 36 amino acid 
sequence (reading from amino to carboxy terminus) : 

30 

NH 2 -LEANISQSLEQAQIQQEKNMYELQKLNSWDVFTNWL-COOH (SEQ ID: 7) 
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Table III and Table IV show some possible truncations 
of the HIV-2 NIHZ DP178 analog, which may comprise 
peptides of between 3 and 36 amino acid residues 
( i.e. , peptides ranging in size from a tripeptide to a 
36-mer polypeptide) . Peptide sequences in these ' 
5 tables are listed from amino (left) to carboxy (right) 
terminus. "X fl may represent an amino group ( -NH 2 ) and 
"Z" may represent a carboxyl (-COOH) group. 
Alternatively, "X" may represent a hydrophobic group, 
including but not limited to carbobenzyl, dansyl, or 
T-butoxycarbonyl; an acetyl group; a 9- 
f luorenylmethoxy-carbonyl (FMOC) group; or a 
covalently attached macromolecular group, including 
but not limited to a lipid- fatty acid conjugate, 
polyethylene glycol, carbohydrate or peptide group. 
^ Further, "Z" may represent an amido group; a T- 
butoxycarbonyl group; or a covalently attached . 
macromolecular group, including but not limited to a 
lipid- fatty acid conjugate, polyethylene glycol, 
carbohydrate or peptide group. A preferred "X" or tt Z tt 
macromolecular group is a peptide group. 



25 



30 
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TABLE III 

HIV-2 NIH2 DP178 analog carboxy truncations. 

X-LEA-Z 

X-LEAN-Z 

X- LEANI -Z 

X-LEANIS-Z 
5 X-LEANISQ-Z 

X-LEANISQS-Z 

X-LEANISQSL-Z 

X - LEANI SQSLE-Z 

X-LEANISQSLEQ-Z 

X-LEANISQSLEQA-Z 

X-LEANISQSLEQAQ-Z 

X-LEANISQSLEQAQI-Z 
10 X- LEANI SQSLEQAQIQ-Z 

X-LEANISQSLEQAQIQQ-Z 

X - LEANISQSLEQAQIQQE - Z 

X- LEANI SQSLEQAQIQQEK-Z 

X- LEANI SQSLEQAQIQQEKN- Z 

X-LEANISQSLEQAQIQQEKNM-Z 

X - LEAN I S QSLEQAQ I QQEKNMY - Z 

X-LEANISQSLEQAQIQQEKNMYE-Z 
15 X - LEANI SQSLEQAQIQQEKNMYEL-Z 

X-LEANISQSLEQAQIQQEKNMYELQ-Z 

X - LEAN I S Q S LEQ AQ I QQE KNM YELQK - Z 

X - LEANI SQSLEQAQIQQEKNMYELQKL - Z 

X-LEANISQSLEQAQIQQEKNMYELQKLN-Z 

X-LEANISQSLEQAQIQQEKNMYELQKLNS-Z 

X-LEANISQSLEQAQIQQEKNMYELQKLNSW-Z 

X - LEAN I S Q S LEQ AQ I QQEKNM YE LQKLNS WD - Z 
2 0 X-LEANISQSLEQAQIQQEKNMYELQKLNSWDV-Z 

X - LEANI SQSLEQAQIQQEKNMYELQKLNSWDVF - Z 

X - LEANI SQSLEQAQIQQEKNMYELQKLNS WDVFT - Z 

X-LEANISQSLEQAQIQQEKNMYELQKLNSWDVFTN-Z 

X-LEANISQSLEQAQIQQEKNMYELQKLNSWDVFTNW-Z 

X - LEANI SQSLEQAQIQQEKNMYELQKLNS WDVFTNWL - Z 



The one letter amino acid code is used. 

25 
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TABLE IV 

HIV-2 NIH2 DP178 analog amino truncations. 

X-NWL-Z 
X-TNWL-Z 
X-FTNWL-Z 
X -VFTNWL- Z 
X-DVFTNWL-Z* 
X-WDVFTNWL-Z 
X-SWDVFTNWL-Z 
X -NS WDVFTNWL -Z 
X-LNS WDVFTNWL -Z 
X - KLNS WDVFTNWL - Z 
X - QKLNSWDVFTNWL - Z 
X - LQKLNS WDVFTNWL - Z 
X - ELQKLNS WDVFTNWL - Z 
X - YELQKLNS WDVFTNWL - Z 
X - MYELQKLNS WDVFTNWL - Z 
X - NMYELQKLNS WD VFTNWL - Z 
X - KNMYELQKLNS WDVFTNWL - Z 
X - EKNMYELQKLNS WDVFTNWL - Z 
X - QEKNMYELQKLNSWDVFTNWL - Z 
X - QQEKNMYELQKLNSWD VFTNWL - Z 
X - IQQEKNMYELQKLNSWD VFTNWL - Z 
X - Q I QQE KNMYELQKLNS WD VFTNWL - Z 
X -AQIQQEKNMYELQKLNSWDVFTNWL-Z 
X - QAQ IQQEKNMYELQKLNS WDVFTNWL - Z 
X - EQAQ I QQEKNMYELQKLNS WDVFTNWL - Z 
X - LEQAQ IQQEKNMYELQKLNS WDVFTNWL- Z 
X - S LEQAQ I QQE KNMYELQKLNS WD VFTNWL - Z 
X - QSLEQAQIQQEKNMYELQKLNSWDVFTNWL - Z 
X-SQSLEQAQIQQEKNMYELQKLNSWDVFTNWL-Z 
X -ISQSLEQAQIQQEKNMYELQKLNSWD VFTNWL- Z 
X -NI SQSLEQAQ I QQE KNMYELQKLNS WDVFTNWL - Z 
X-ANI SQSLEQAQIQQEKNMYELQKLNS WDVFTNWL - Z 
X - E ANI SQS LEQAQ I QQEKNMYELQKLNS WD VFTNWL - Z 
X - LEANI SQSLEQAQIQQEKNMYELQKLNS WDVFTNWL - Z 



The one letter amino acid code is used. 

25 
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DP178 and DP107 analogs are recognized or 
identified, * for example, by utilizing one or more of 
the 107x178x4, ALLMOTI5 or PLZIP computer-assisted 
search strategies described and demonstrated, below, 
in the Examples presented in Sections 9 through 16 and 
5 19 through 25. The search strategy identifies 

additional peptide regions which are predicted to have 
structural and/or amino acid sequence features similar 
to those of DP107 and/or DP178. 

The search strategies are described fully, below, 

10 in the Example presented in Section 9 . While this 

search strategy is based, in part, on a primary amino 
acid motif deduced from DP107 and DP178, it is not 
based solely on searching for primary amino acid 
sequence homologies, as such protein sequence 

^ homologies exist within, but not between major groups 
of viruses. For example, primary amino acid sequence 
homology is high within the TM protein of different 
strains of HIV-1 or within the TM protein of different 
isolates of simian immunodeficiency virus (SIV) . 
Primary amino acid sequence homology between HIV-1 and 

20 SIV, however, is low enough so as not to be useful. 

It is not possible, therefore, to find peptide regions 
similar to DP107 or DP178 within other viruses, or 
within non-viral organisms, whether structurally, or 
otherwise, based on primary sequence homology, alone. 

25 Further, while it would be potentially useful to 

identify primary sequence arrangements of amino acids 
based on, for example, the physical chemical 
characteristics of different classes of amino acids 
rather than based on the specific amino acids 
themselves, such search strategies have, until now, 

3 0 

proven inadequate. For example, a computer algorithm 
designed by Lupas et al. to identify coiled-coil 
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propensities of regions within proteins (Lupas, A., et 
al., 1991 Science 252:1162-1164) is inadequate for 
identifying protein regions analogous to DP107 or 
DP178. 

Specifically, analysis of HIV-1 gp!60 (containing 
5 both gpl2 0 and gp41) using the Lupas algorithm does 
not identify the coiled-coil region within DP107. It 
does, however, identify a region within DP178 
beginning eight amino acids N-terminal to the start of 
DP178 and ending eight amino acids from the C- 

10 terminus. The DP107 peptide has been shown 

experimentally to form a stable coiled coil. A search 
based on the Lupas search algorithm, therefore, would 
not have identified the DP107 coiled-coil region. 
Conversely, the Lupas algorithm identified the DP178 
region as a potential coiled-coil motif. However, the 
peptide derived from the DP178 region failed to- form a 
coiled coil in solution. 

A possible explanation for the inability of the 
Lupas search algorithm to accurately identify coiled- 
coil sequences within the HIV-1 TM, is that the Lupas 

20 algorithm is based on the structure of coiled coils 
from proteins that are not structurally or 
functionally similar to the TM proteins of viruses, 
antiviral peptides ( e.g. DP107 and DP178) of which are 
an object of this invention. 

25 The computer search strategy of the invention, as 

demonstrated in the Examples presented below, in 
Sections 9 through 16 and 19 through 25, successfully 
identifies regions of proteins similar to DP107 or 
DP178. This search strategy was designed to be used 

30 with a COTnmer cially-available sequence database 
package, preferably PC/Gene. 
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A series of search motifs, the 107x178x4, 
ALLMOTI5 and PLZIP motifs, were designed and 
engineered to range in stringency from strict to 
broad, as discussed in this Section and in Section 9, 
with 107x178x4 being preferred. The sequences 
5 identified via such search motifs, such as those 
listed in Tables V-XIV, below, potentially exhibit 
antifusogenic, such as antiviral, activity, may 
additionally be useful in the identification of 
antifusogenic, such as antiviral, compounds, and are 

10 intended to be within the scope of the invention. 

Coiled-coiled sequences are thought to consist of 
heptad amino acid repeats. For ease of description, 
the amino acid positions within the heptad repeats are 
sometimes referred to as A through G, with the first 

^ position being A, the second B, etc. The motifs used 
to identify DP107-like and DP178-like sequences herein 
are designed to specifically search for and identify 
such heptad repeats. In the descriptions of each of 
the motifs described, below, amino acids enclosed by 
brackets , i.e., [] , designate the only amino acid 

20 residues that are acceptable at the given position, 
while amino acids enclosed by braces, i.e., {}, 
designate the only amino acids which are unacceptable 
at the given heptad position. When a set of bracketed 
or braced amino acids is followed by a number in 

25 parentheses i.e., () , it refers to the number of 
subsequent amino acid positions for which the 
designated set of amino acids hold, e.g, a (2) means 
"for the next two heptad amino acid positions". 

30 
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The ALLMOTI5 is written as follows: 

CDGHP}-{CFP} (2) -{CDGHP}-{CFP} (3)- 

CDGHP}-{CFP} (2) -{CDGHP}-{CFP} (3) - 

CDGHP}-{CFP} (2) -{CDGHP}-{CFP} (3) - 
{CDGHP}-{CFP} (2) -{CDGHP}-{CFP} (3) - 
{CDGHP}-{CFP} (2) -{CDGHP}-{CFP} (3) - 

5 Translating this motif, it would read: "at the 

first (A) position of the heptad, any amino acid 

residue except C, D, G, H, or P is acceptable, at the 

next two (B,C) amino acid positions, any amino acid 

residue except C, F, or P is acceptable, at the fourth 

heptad position (D) , any amino acid residue except C, 

D, G, H, or P is acceptable, at the next three (E, F, 

G) amino acid positions, any amino acid residue except 

C, F, or P is acceptable. This motif is designed to 

search for five consecutive heptad repeats (thus the* " 

repeat of the first line five times) , meaning that it 

15 searches for 35-mer sized peptides. It may also be 

designed to search for 2 8-mers, by only repeating the 

initial motif four times. With respect to the 

ALLMOTI5 motif, a 35-mer search is preferred. Those 

viral (non-bacteriophage) sequences identified via 

20 such an ALLMOTI5 motif are listed in Table V in U.S. 

Patent Application No. 08/470,896 filed on June 6, 

1995 which is incorporated herein by reference in its 

entirety. These viral sequences potentially exhibit 

antiviral activity, may be useful in the the 

identification of antiviral compounds, and are 

intended to be within the scope of the invention. In 

those instances wherein a single gene exhibits greater 

than one sequence recognized by the ALLMOTI5 search 

motif, the amino a cid residue numbers of these 

sequences are listed under "Area 2", Area 3", etc. 

30 This convention is used for each of the Tables listed, 

below, at the end of this Section. 
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The 107x178x4 motif is written as follows: 

[EFIKLNQSTVWY] -{CFMP} (2)- [EFIKLNQSTVWY] -{CFMP} (3)- 
[EFIKLNQSTVWY] -{CFMP} (2)- [EFIKLNQSTVWY] -{CFMP} (3) - 
[EFIKLNQSTVWY] -{CFMP} (2) - [EFIKLNQSTVWY] -{ CFMP } (3) - 
[EFIKLNQSTVWY] -{CFMP} (2) - [EFIKLNQSTVWY] -{CFMP} (3) - 

Translating this motif, it would read: "at the 

5 first (A) position of the heptad, only amino acid 

residue E, F, I, K, L, N, Q, S, T, V, W, or Y is 

acceptable, at the next two (B,C) amino acid 

positions, any amino acid residue except C, F, M or P 

is acceptable, at the fourth position (D) , only amino 

10 acid residue E, F, I, K, L, N, Q, S, T, V, W, or Y is 

acceptable, at the next three (E, F, G) amino acid 

positions, any amino acid residue except C, F, M or P 

is acceptable. This motif is designed to search for 

four consecutive heptad repeats (thus the repeat of 

the first line four times) , meaning that it searches 

15 

for 28-mer sized peptides. It may also be designed to 
search for 35-mers, by repeating the initial motif 
five times. With respect to the 107x178x4 motif, a 
2 8-mer search is preferred. 

Those viral (non-bacteriophage) sequences 

20 identified via such a 107x178x4 motif are listed in 
Table VI in U.S. Patent Application No. 08/470,896 
filed on June 6, 1995, which is incorporated herein, 
by reference, in its entirety. Those viral (non- 
bacteriophage) sequences listed in Table VII of U.S. 

25 Patent Application No. 08/470,896 (incorporated herein 
by reference in its entirety) are particularly 
preferred. 

The 107x178x4 search motif was also utilized to 
identify non-viral procaryotic protein sequences, as 
^ listed in Table VIII in U.S. Patent Application No. 
08/470,896 filed on June 6, 1995, which is 
incorporated herein, by reference, in its entirety. 
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Further, this search motif was used to reveal a number 
of human proteins. The results of this human protein 
107x178x4 search is listed in Table IX in U.S. Patent 
Application No. 08/470,896 filed on June 6, 1995, 
which is incorporated herein, by reference, in its 
5 entirety. The sequences listed in Tables VIII and IX, 
therefore, reveal peptides which may be useful as 
antifusogenic compounds or in the identification of 
antifusogenic compounds, and are intended to be within 
the scope of the invention. 

10 The PLZIP series of motifs are as listed in FIG. 

19. These motifs are designed to identify leucine 
zipper coiled-coil like heptads wherein at least one 
proline residue is present at some predefined distance 
N-terminal to the repeat. These PLZIP motifs find 

^ regions of proteins with similarities to HIV-1 DP178 
generally located just N-terminal to the transmembrane 
anchor. These motifs may be translated according to 
the same convention described above. Each line 
depicted in FIG. 19 represents a single, complete 
search motif. "X" in these motifs refers to any amino 

20 acid residue. In instances wherein a motif contains 
two numbers within parentheses, this refers to a 
variable number of amino acid residues. For example, 
X (1,12) is translated to "the next one to twelve 
amino acid residues, inclusive, may be any amino 

25 acid" . 

Tables X through XIV in U.S. Patent Application 
No. 08/470,896 filed on June 6, 1995 (which is 
incorporated herein, by reference, in its entirety) , 
list sequences identified via searches conducted with 
3Q such PLZIP motifs. Specifically, Table X lists viral 
sequences identified via PCTLZIP, P1CTLZIP and 
P2CTLZIP search motifs, Table XI lists viral sequences 
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identified via P3CTLZIP, P4CTLZIP, P5CTLZIP and 
P6CTLZIP search motifs, Table XII Ists viral sequences 
identified via P7CTLZIP, P8CTLZIP and P9CTLZIP search 
motifs, Table XIII lists viral sequences identified 
via P12LZIPC searches and Table XIV lists viral 
5 sequences identified via P23TLZIPC search motifs The 
viral sequences listed in these tables represent 
peptides which potentially exhibit antiviral activity, 
may be useful in the identification of antiviral 
compounds, and are intended to be within the scope of 

10 the invention. 

The Examples presented in Sections 17, 18, 26 and 
27 below, demonstrate that viral sequences identified 
via the motif searches described herein identify 
substantial antiviral characteristics. Specifically, 

^ the Example presented in Section 17 describes peptides 
with ant i- respiratory syncytial virus activity,* the 
Example presented in Section 18 describes peptides 
with anti-parainf luenza virus activity, the Example 
presented in Section 26 describes peptides with anti- 
measles virus activity and the Example presented in 

20 Section 27 describes peptides with anti-simian 
immunodeficiency virus activity. 

The DP107 and DP178 analogs may, further, contain 
any of the additional groups described for DP178, 
above, in Section 5.1. For example, these peptides 

25 may include any of the additional amino -terminal 

groups as described above for "X n groups, and may also 
include any of the carboxy- terminal groups as 
described, above, for 11 Z" groups. 

Additionally, truncations of the identified DP107 

3Q and DP17 8 peptides are among the peptides of the 

invention. Further, such DPI 07 and DPI 7 8 analogs and 
DP107/DP178 analog truncations may exhibit one or more 
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TABLE V 

T 

No. Sequence 

1 GIKQLQARILAVERYLKDQ 

2 NNLLRAIEAQQHLLQLTVW 

3 NEQELLELDKWASLWNWF 

4 YTSL IHSL I EES QNQQEK 

5 AC - VWGIKQLQARILAVERYLKDQQLLGIWG -NH2 

6 QHLLQLTVWGIKQLQARILAVERYLKDQ 

7 LRAIEAQQHLLQLTVWGIKQLQARILAV 

8 VQQQNNLLARIEAQQHLLQLTVWGIKQL 

9 RQLLSG I VQQQNNLLRAIEAQQHLLQLT 

1 0 MTLTVQARQLLSG I VQQQNNLLRAIEAQ 

12 WSLSNGVSVLTSKVLDLKNYIDKQLL 

13 LLSTNKAWSLSNGVSVLTSKVLDLKNY 

15 AC-VLHLEGEVNKIKSALLSTNKAVVSLSNG-NH2 

19 AC - LLS TNKAWS LSNGVS VLTSKVLDLKNY -NH2 

20 Ac - YTSL IHSL IEESQNQQEKNEQELLELDKWASLWNWF - NH2 

21 AC-NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 

22 AC-IELSNIKENKCNGTDAKVKLIKQELDKYKNAVTELQLLMQST-NH2 

23 Ac - IELSNIKENKCNGTDAKVKLIKQELDKY-NH2 

24 Ac - ENKCNGTDAKVKL IKQELDKYKNAVTEL -NH2 

2 5 Ac -DAKVKLIKQELDKYKNAVTELQLLMQST-NH2 

26 Ac - CNGTDAKVKL IKQELDKYKNAVTELQLL -NH2 

27 Ac - SNIKENKCNGTDAKVKLIKQELDKYKNAVTELQLL -NH2 

28 Ac -ASGVAVSKVLHLEGEVNKIKSALLSTNKAWSLSNGV-NH2 

29 Ac - SGVAVSKVLHLEGEVNKIKSALLSTNKAVVSLSNG -NH2 

30 Ac -VLHLEGEVNKIKSALLSTHKAWSLSNGVSVLTSK-NH2 

31 Ac -ARKLQRMKQLEDKVEELLSKNYHYLENEVARLKKLV-NH2 

32 Ac - RMKQLEDKVEELL S KNYH YLENEVARLKKL VGER - NH2 

3 3 Ac - VQQQNNLLRAIEAQQHLLQLTVWGIKQL -NH2 
34 Ac -LRAIEAQQHLLQLTVWGIKQLQARILAV -NH2 
3 5 Ac -QHLLQLTWGIKQLQARILAVERYLKDQ-NH2 
36 Ac -RQLLSGIVQQQNNLLRAIEAQQHLLQLT-NH2 
3 7 Ac -MTLTVQARQLLSGIVQQQNNLLRAIEAQ-NH2 
3 8 Ac - AKQARSDIEKLKEAIRDTNKAVQSVQSS -NH2 

3 9 Ac -AAVALVEAKQARSDIEKLKEAIRDTNKAVQSVQSS -NH2 

40 Ac - AKQARSD IEKLKEAIRDTNKAVQS VQS S IGNLIVA -NH2 

41 Ac -GTIALGVATSAQITAAVALVEAKQARSD -NH2 

42 Ac - ATSAQITAAVALVEAKQARSDIEKLKEA-NH2 

43 Ac -AAVALVEAKQARSDIEKLKEAIRDTNKANH2 

44 Ac - IEKLKEAIRDTNKAVQSVQSS IGNLIVA-NH2 

45 Ac - IRDTNKAVQS VQS S IGNLIVAIKS VQDY - NH2 

46 Ac - AVQS VQS S IGNLIVAIKS VQDYVNKE IV- NH2 

47 Ac - QARQLLSG IVQQQNNLLRAIEAQQHLLQLTVWG IKQLARILAVERYLKDQ - NH2 

48 AC - QARQLLSGIVQQQNNLLRAIEAQQHLLQ -NH2 

49 Ac-MTWMEMDREINNyTSLIGSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

50 AC -WMEWDREINNYTSLIGSLIEESQNQQEKNEQELLE -NH2 

51 AC - INNYTSLIGSLIEESQNQQEKNEQELLE -NH2 

52 AC - INNYTSL IGSLIEESQNQQEKNEQELLELDKWASL -NH2 

53 AC-EWDREINNYTSLIGSLIEESQNQQEKNEQEGGC-NH2 

54 AC - QSRTLLAGIVQQQQQLLDWKRQQELLR-NH2 
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T 

No. Sequence m _ a 

55 AC -NNDTWQEWERKVDFLEENITALLEEAQIQQEKNMYELQKLNSWD -NH2 

56 Ac - WQEWERKVDFLEENITALLEEAQIQQEK-NH2 

57 AC - VDFLEENITALLEEAQIQQEKNMYELQK-NH2 

58 Ac - ITALLEEAQIQQEKNMYELQKLNSWDVF-NH2 

59 AC - SSESFTLLEQWNNWKLQLAEQWLEQINEKHYLEDIS -NH2 

60 AC-DKWASLWNWF-NH2 

5 61 Ac -NEQELLELDKWASLWNWF-NH2 

62 Ac -EKNEQELLELDKWASLWNWF-NH2 

63 AC -NQQEKNEQELLELDKWASLWNWF-NH2 

64 AC -ESQNQQEKNEQELLELDKWASLWNWF-NH2 

65 AC-LIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

66 Ac -NDQKKLMSNNVQIVRQQSYS IMS I IKEE - NH2 

67 AC -DEFDASISQVNEKINQSLAFIRKSDELL-NH2 

68 AC-VSKGYSALRTGWYTSVITIELSNIKEN-NH2 
10 69 Ac - VVSLSNGVSVLTSKVLDLKNYIDKQLL-NH2 

70 Ac - WKIKSALLSTNKAWSLSNGVSVLTSK-NH2 

71 AC-PIINFYDPLVFPSDEFDASISQVNEKINQSLAFIR-NH2 

72 Ac -NLVYAQLQFTYDTLRGYINRALAQIAEA-NH2 

73 AC - LNQVDLTETLERYQQRLNTYALVS KDAS YRS -NH2 

74 AC -ELLVLKKAQLNRHSYLKDSDFLDAALD-NH2 

75 Ac -LAEAGEESVTEDTEREDTEEEREDEEE-NH2 

76 Ac -ALLAEAGEESVTEDTEREDTEEEREDEEEENEART-NH2 
15 77 AC - ETERS VDLVAALLAEAGEES VTEDTEREDTEEERE -NH2 

78 AC -EESVTEDTEREDTEEEREDEEEENEART-NH2 

79 AC - VDLVAALLAEAGEES VTEDTEREDTEEE -NH2 

80 AC -NSETERS VDLVAALLAEAGEES VTE-NH2 

81 Ac -DISYAQLQFTYDVLKDYINDALRNIMDA-NH2 

82 AC-SNVFSKDEIMREYNSQKQHIRTLSAKVKDN-NH2 

83 B i o t in - YTS L IHS L I EES QNQQEKNEQELLELDKWAS L WNWF - NH2 

84 Dig-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

2 q 85 Biotin-NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 

86 Dig - NKLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ - NH2 

87 - AC -VLHQLNIQLKQYLETQERLLAGNRIAARQLLQIWKDVA-NH2 

88 Ac - LWHEQLLNTAQRAGLQLQLINQALAVREKVLIRYDIQK-NH2 
8 9 AC -LLDNFESTWEQSKELWEQQEIS IQNLHKSALQEYW-NH2 

90 Ac - LSNLLQISNNSDEWLEALEIEHEKWKLTQWQSYEQF -NH2 

91 Ac - KLEALEGKLEALEGKLEALEGKLEALEGKLEALEGK-NH2 

92 Ac - ELRALRGELRALRGELRALRGELRALRGK-NH2 

93 AC - ELKAKELEGEGLAEGEEALKGLLEKAAKLEGLELLK-NH2 
25 94 Ac - WEAAAREAAAREAAAREAAARA-NH2 

95 AC - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNAF -NH2 

96 AC -YTSLIHSLIEESQNQQEKNEQELLELDKWASLANWF-KH2 

97 AC - YTSLIHSLIEESQNQQEKNQQELLELDKWASLWNWF -NH2 

98 Ac -YTSLIHSLIEESQNQQEKNEQELLQLDKWASLWNWF-NH2 

99 Ac - YTSLIHSLIEESQNQQEKNQQELLQLDKWASLWNWF-NH2 

100 AC -RMKQLEDKVEELLSKNYHLENEVARLKKLVGER-NH2 

101 AC - QQLLQLTVWG I KQLQARI LAVERYLKNQ - NH2 

3 0 102 AC - NEQELLELDKWASLWNWF -NH2 

103 Ac - YTSLIQSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

104 AC - 1 INFYDPLVFPSDEFDAS ISQVNEKINQSLAFIRK-NH2 
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No. Sequence 

105 Ac - INFYDPLVFPSDEFDASISQVNEKINQSLAFIRKS -NH2 

106 AC-NFYDPLVFPSDEFDASISQVNEKINQSLAFIRKSD-NH2 

107 AC- FYDPLVFPSDEFDAS ISQVNEKINQSLAFIRKSDE -NH2 

108 AC - YDPLVFPSDEFDAS ISQVNEKINQSLAFIRKSDEL -NH2 

109 Ac -DPLVFPSDEFDAS ISQVNEKINQSLAFIRKSDELL -NH2 

110 Ac - PLVFPSDEFDASISQVNEKINQSLAFIRKSDELLH -NH2 
5 111 Ac -LVFPSDEFDASISQVNEKINQSIAFIRKSDELLHN-NH2 

112 Ac - VFPSDE FDAS I S Q VNE KINQS LAF IRKS DELLHNV - NH2 

113 Ac -FPSDEFDASISQVNEKINQSLAFIRKSDELLHNVN-NH2 

114 AC - PSDEFDASISQVNEKINQSIAFIRKSDELLHNVNA-NH2 

115 Ac - SDEFDAS ISQVNEKINQSLAFIRKSDELLHNVNAG -NH2 

116 Ac - DEFDAS ISQVNEKINQSLAFIRKSDELLHNVNAGK- NH2 

117 AC - EFDAS ISQVNEKINQSLAFIRKSDELLHNVNAGKS -NH2 

118 Ac - FDAS ISQVNEKINQSIAFIRKSDELLHNVNAGKST-NH2 
10 119 Ac - DAS ISQVNEKINQSLAFIRKSDELLHNVNAGKSTT -NH2 

12 0 Ac - AS GVAVS KVLHLEGEVNKIKS ALLSTNKAWSLSN - NH2 

121 Ac - SGVAVSKVLHLEGEVNKIKSALLSTNKAVVSLSNG-NH2 

122 Ac - GVAVS KVLHLEGEVNKI KS ALLS TNKA WSLSNGV - NH2 

123 Ac -VAVS KVLHLEGEVNKI KSALLSTNKAVVSLSNGVS -NH2 

124 Ac -AVSKVLHLEGEVNKIKSALLSTNKAWSLSNGVSV-NH2 

125 Ac - VSKVLHLEGEVNKIKSALLSTNKAVVSLSNGVSVL -NH2 

126 Ac - SKVLHLEGEVNKIKSALLSTNKAVVSLSNGVS VLT-NH2 
15 127 Ac -KVLHLEGEVNKIKS ALLSTNKAWSLSNGVSVLTS -NH2 

12 8 Ac - VLHLEGETOKIKSALLSTNKAWSLSNGVSVLTSK-NH2 

12 9 Ac - LHLEGEVNKIKSALLSTNKAWSLSNGVS VLTSKV-NH2 

13 0 Ac -HLEGEVNKIKSALLSTNKAWSLSNGVSVLTSKVL-NH2 

13 1 Ac -LEGEVNKIKSALLSTNKAWSLSNGVSVLTSKVLD-NH2 

132 Ac - EGEVNKIKS ALLSTNKAWSLSNGVSVLTS KVLDL -NH2 

133 Ac -GEVNKIKSALLSTNKAWSLSNGVSVLTSKVLDLK-NH2 

134 AC-EVNKIKSALLSTNKAWSLSNGVSVLTSKVLDLKN-NH2 
20 135 AC-VNKIKSALLSTNKAWSLSNGVSVLTSKVLDLKNY-NH2 

136 AC-NKIKSALLSTNKAWSLSNGVSVLTSKVLDLKNYI-NH2 

137 AC-KIKSALLSTOKAWSLSNGVSVLTSKVLDLKNYID-NH2 
13 8 Ac - IKSALLSTNKAWSLSNGVSVLTSKVLDLKNYIDK-NH2 

13 9 Ac - KS ALLSTNKAWSLSNGVSVLTS KVLDL KNYIDKQ - NH2 

14 0 Ac - S ALLS TNKAWS LSNG VS VLTS KVLDLKNYIDKQL - NH2 

141 Ac - ALLSTNKAWSLSNGVS VLTSKVLDLKNYIDKQLL - NH2 

142 AC-YTSVITIELSNIKENKCNGTDAKVKLIKQELDKYK-NH2 
5 143 AC-TSVITIELSNIKENKCNGTDAKVKLIKQELDKYKN-NH2 

144 Ac -SVITIELSNIKENKCNGTDAKVKLIKQELDKYKNA-NH2 

14 5 AC - VIT IELSNIKENKCNGTDAKVKL IKQELDKYKNAV - NH2 

14 6 AC - ITIELSNIKENKCNGTDAKVKLIKQELDKYKNAVT-NH2 

147 AC - TIELSNIKENKCNGTDAKVKLIKQELDKYKNAVTE -NH2 

148 AC - IELSNIKENKCNGTDAKVKL IKQELDKYKNAVTEL - NH2 
14 9 AC - ELSNIKENKCNGTDAKVKLIKQELDKYKNAVTELQ -NH2 

150 Ac - LSNIKENKCNGTDAKVKLIKQELDKYKNAVTELQL -NH2 

151 AC - SNIKENKCNGTDAKVKLIKQELDKYKNAVTELQLL -NH2 
30 152 AC -NIKENKCNGTDAKVKLIKQELDKYKNAVTELQLLM-NH2 

153 AC - 1 KENKCNGTDAKVKL IKQELDKYKNAVTELQLLMQ - NH2 

154 Ac - KENKCNGTDAKVKL I KQELDKYKNAVTELQLLMQS -NH2 
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155 Ac -ENKCNGTDAKVKLIKQELDKYKNAVTELQLLMQST-NH2 

156 AC - LLDNFESTWEQS KELWELQE IS IQNLHKSALQEYWN - NH2 

157 AC-ALGVATSAQITAAVALVEAKQARSDIEKLKEAIRD-NH2 

158 AC - LGVATSAQITAAVALVEAKQARSDIEKLKEAIRDT -NH2 

159 Ac - GVATSAQ ITAAVALVEAKQARSD IEKLKEAIRDTN - NH2 

160 Ac - VATSAQITAAVALVEAKQARSDIEKLKEAIRDTNK-NH2 
5 161 Ac -ATSAQITAAVALVEAKQARSDIEKLKEAIRDTNKA-NH2 

162 AC - TS AQ ITAAVALVEAKQARSD IEKLKEAIRDTNKAV- NH2 

163 Ac - SAQ ITAAVALVEAKQARSD I EKLKEAIRDTNKAVQ-NH2 

164 Ac - AQ I TAAVALVEAKQ ARS D I EKLKEA I RDTNKAVQ S -NH2 

165 Ac -Q ITAAVALVEAKQARSD IEKLKEAIRDTNKAVQSV-NH2 

166 Ac - ITAAVALVEAKQARSDIEKLKEAIRDTNKAVQSVQ-NH2 

167 AC - TAAVALVEAKQARS DIE KLKEA I RDTNKAVQ SVQ S -NH2 
166 Ac -AAVALVEAKQARSDIEKLKEAIRDTMKAVQSVQSS -NH2 

10 169 AC - AVALVEAKQARSDIEKLKEAIRDTNKAVQSVQS S I -NH2 

170 AC-VALVEAKQARSDIEKLKEAIRDTNKAVQSVQSSIG-NH2 

171 Ac -ALVEAKQARSDIEKLKEAIRDTNKAVQS VQS S IGN -NH2 

172 Ac -LVEAKQARSDIEKLKEAIRDTNKAVQSVQSSIGNL-NH2 

173 Ac - VEAKQARSDIEKLKEAIRDTNKAVQSVQSSIGNLI -NH2 

1 74 AC -EAKQARSDIEKLKEAIRDTNKAVQSVQSSIGNLIV-NH2 

175 Ac - KQARSDIEKLKEAIRDTNKAVQSVQSS IGNLIVAI -NH2 

176 Ac - QARSDIEKLKEAIRDTNKAVQSVQSS IGNLIVAIK-NH2 
15 177 AC - ARSDIEKLKEAIRDTNKAVQS VQSS K5NLIVAIKS -NH2 

17 8 AC -RSDIEKLKEAIRDTNKAVQSVQSSIGNLIVAIKSV-NH2 

179 AC-SDIEKLKEAIRDTNKAVQSVQSSIGNLIVAIKSVQ-NH2 

180 AC - D IEKLKEAIRDTNKAVQS VQ S S IGNL I VAIKS VQD -NH2 

181 Ac - IEKLKEAIRDTNKAVQS VQS S IGNLIVAI KS VQD Y - NH2 

182 AC - EKLKEAIRDTNKAVQSVQS S IGNLI VAIKS VQD YV-NH2 

183 AC-KLKEAIRDTNKAVQSVQSSIGNLIVAIKSVQDYVN-NH2 

184 Ac -LKEAIRDTNKAVQSVQSS IGNLIVAIKSVQDYVNK-NH2 
2 q 185 Ac - KEAIRDTNKA VQS VQS S IGNL IVA IKS VQD YVNKE - NH2 

186 Ac - EAI RDTNKAVQS VQ S S I GNL I VAI KS VQD YVNKE I -NH2 

187 Ac - AIRDTNKAVQS VQS S IGNL I VAIKS VQDYVNKE I V -NH2 

188 AC - 1 RDTNKAVQS VQSS IGNLI VAI KSVQD YVNKE IV- NH2 

189 Ac - YTPNDITLNNS VALD P ID I S IELNKAKSDLEES KE -NH2 

190 Ac - TPND ITLNNS VALDP ID I S IELNKAKSDLEES KEW -NH2 

191 Ac - PNDITLNNS VALDP ID IS IELNKAKSDLEESKEWI -NH2 

192 AC-NDITLNNSVALDPIDISIELNKAKSDLEESKEWIR-NH2 

193 AC-DITLNNSVALDPIDISIELNKAKSDLEESKEWIRR-NH2 
25 194 AC - ITLNNSVALDPIDIS IELNKAKSDLEESKEWIRRS -NH2 

195 AC -TLNNSVALDPIDISIELNKAKSDLEESKEWIRRSN-NH2 

196 AC - LNNS VALDPIDIS IELNKAKSDLEES KEWIRRSNQ-NH2 

197 AC -NNSVALDPIDISIELNKAKSDLEESKEWIRRSNQK-NH2 

198 AC-NSVALDPIDISIELNKAKSDLEESKEWIRRSNQKL-NH2 

200 Ac - SVALDPIDISIELNKAKSDLEESKEWIRRSNQKLD-NH2 

201 AC - VALDPIDISIELNKAKSDLEESKEWIRRSNQKLDS -NH2 
2 02 Ac - ALDPIDI S IELNKAKSDLEESKEWIRRSNQKLDS I -NH2 

30 203 AC -LDPIDIS IELNKAKSDLEESKEWIRRSNQKLDS IG-NH2 

204 Ac- DP IDIS IELNKAKSDLEES KEWIRRSNQKLDS IGN -NH2 

2 05 Ac - PIDISIELNKAKSDLEESKEWIRRSNQKLDSIGNW-NH2 
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206 AC -ID IS IELNKAKSDLEESKEWIRRSNQKLDS IGNWH-NH2 

207 AC -DISIELNKAKSDLEESKEWIRRSNQKLDSIGNWHQ-NH2 

208 AC - IS IELNKAKSDLEESKEWIRRSNQKLDS IGNWHQS -NH2 

209 AC - S IELNKAKSDLEESKEWIRRSNQKLDS IGNWHQSS -NH2 

210 AC - IELNKAKSDLEE S KEWIRRSNQKLDS IGNWHQS ST - NH2 

211 Ac -ELNKAKSDLEES KEWIRRSNQKLDS IGNWHQSSTT-NH2 

2 12 AC - ELRALRGELRALRGELRALRGELRALRGELRALRGK-NH2 

213 AC - YTSLIHSLIEESQNQQQKNEQELLELDKWASLWNWF-NH2 

214 AC - YTS LIHSLIEES QNQQEKNEQELLELNKWASL WNWF - NH2 

215 AC - YTS L IHSL IEQS QNQQEKNEQELLELDKWAS LWNWF -NH2 

216 Ac - YTS L IHSLIQES QNQQEKNEQELLELDKWASLWNWF - NH2 

217 AC - YTSLIHSLIQQSQNQQQKNQQQLLQLNKWASLWNWF-NH2 

218 AC -EQELLELDKWASLWNWF-NH2 

219 AC -QELLELDKWASLWNWF-NH2 

220 Ac -ELLELDKWASLWNWF-NH2 

221 Ac -LELDKWAS LWNWF -NH2 

222 AC - ELDKWASLWNWF - NH2 

226 Ac -WASLWNWF-NH2 

227 Ac -ASLWNWF-NH2 

229 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLANAA-NH2 

230 Ac - YTSLIHSLIEESQNQQEKNEQQLLELDKWASLWNWF-NH2 

231 Ac - YTSLIQSLIEESQNQQEKNQQELLELDKWASLWNWF -NH2 
234 Ac - EAAAREAAAREAAARLELDKWASLWNWF -NH2 

236 Ac - PSLRDP IS AE I S I QALS YALGGDINKVLEKLGYSG - NH2 

237 Ac-SLRDPISAEISIQALSYALGGDINKVLEKLGYSGG-NH2 

238 Ac -LRDPISAEIS IQALSYALGGDINKVLEKLGYSGGD -NH2 

239 Ac - RDP I S AE I S I QAL S YALGGD INKVL E KL G YS GGDL - NH2 

240 Ac - DPISAE IS IQALS YALGGD INKVLEKLGYSGGDLL-NH2 

241 Ac -PISAEISIQALS YALGGD INKVLEKLGYSGGDLLG -NH2 

242 AC - ISAEIS IQALS YALGGD INKVLEKLGYSGGDLLGI -NH2 

243 AC-SAEISIQALSYALGGDINKVLEKLGYSGGDLLGIL-NH2 

244 Ac - AE I S I QALS YALGGD INKVLEKLG YS GGDLLG I LE - NH2 

245 Ac -EISIQALSYALGGDINKVLEKLGYSGGDLLGILES -NH2 
24 6 Ac - 1 S I QAL S YALGGD INKVLEKLGYSGGDLLG ILESR - NH2 
247 Ac -SIQALSYALGGDINKVLEKLGYSGGDLLGILESRG-NH2 
24 8 Ac - IQALS YALGGD INKVLEKLGYSGGDLLGILESRGI -NH2 
24 9 Ac- QALS YALGGD INKVLEKLGYSGGDLLG ILESRGIK-NH2 

250 Ac- ALS YALGGD INKVLEKLGYSGGDLLGILESRGIKA-NH2 

251 AC - LSYALGGDINKVLEKLGYSGGDLLGILESRGIKAR-NH2 

252 AC - PDAVYLHRIDLGPPI SLERLDVGTNLGNAIAKLED -NH2 

253 Ac - D AVYLHR IDLGPPIS LERLD VGTNLGNA IAKLED A - NH 2 

254 AC- AVYLHRIDLGPPISLERLDVGTNLGNAIAKLEDAK-NH2 

255 AC - VYLHRIDLGPPISLERLDVGTNLGNAIAKLEDAKE -NH2 

256 Ac - YLHRIDLGPPISLERLDVGTNLGNAIAKLEDAKEL-NH2 

257 Ac - LHR IDLGP P I S LERLD VGTNLGNAIAKLEDAKELL - NH2 

258 Ac - HRIDLGPP IS LERLD VGTNLGNAIAKLEDAKELLE - NH2 

259 Ac -RIDLGPPISLERLDVGTNLGNAIAKLEDAKELLES -NH2 
2 60 AC - IDLGPP I S LERLD VGTNLGNAIAKLEDAKELLES S -NH2 
261 Ac - DLGPP I SLERLDVGTNLGNAIAKLEDAKELLES SD - NH2 
2 62 AC-LGPPISLERLDVGTNLGNAIAKLEDAKELLESSDQ-NH2 
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263 Ac -GPPISLERLDVGTNLGNAIAKLEDAKELLESSDQI -NH2 

264 AC - PP I SLERLDVGTNLGNAIAKLEDAKELLES SDQIL-NH2 

265 Ac - P I SLERLDVGTNLGNAIAKLEDAKELLES SDQ ILR - NH2 

266 AC - 1 SLERLDVGTNLGNAIAKLEDAKELLES SDQ IRS -NH2 

267 AC - SLERLDVGTNLGNAIAKLEDAKELLES SDQILRSM - NH2 

268 Ac - LERLDVGTNLGNAIAKLEDAKELLES SDQ I LRSMK - NH2 
5 269 Ac -EWIRRSNQKLDS I -NH2 

270 Ac - LELDKWAS LANAF -NH2 

271 Ac - LELDKWASLFNFF -NH2 

272 Ac - LELDKWAS LANWF -NH2 

273 AC - LELDKWASLWNAF -NH2 

274 AC -ELGNVNNSISNALDKLEESNSKLDKVNVKLTSTSA-NH2 

275 Ac -TELGNVNNS I SNALDKLEESNSKLDKVNVKLTSTS -NH2 

276 AC-STELGNVNNSISNALDKLEESNSKLDKVNVKLTST-NH2 
10 277 Ac - 1 S TELGNVNNS I SNALDKLEESNS KLDKVNVKLTS -NH2 

2 78 Ac -DISTELGNVNNS I SNALDKLEESNSKLDKVNVKLT -NH2 

279 AC-LDISTELGNVNNSISNALDKLEESNSKLDKVNVKL-NH2 

2 80 Ac -NLDISTELGNVNNSISNALDKLEESNSKLDKVNVK-NH2 

281 AC-GNLDISTELGNVNNSISNALDKLEESNSKLDKVNV-NH2 

2 82 AC-TGNLDISTELGNVNNSISNALDKLEESNSKLDKVN-NH2 

283 AC -VTGNLDISTELGNVNNS ISNALDKLEESNSKLDKV-NH2 

2 84 Ac - IVTGNLDISTELGNVNNSISNALDKLEESNSKLDK-NH2 

15 2 85 AC-VIVTGNLDISTELGNVNNSISNALDKLEESNSKLD-NH2 

2 86 Ac -QVIVTGNLDISTELGNVNNSISNALDKLEESNSKL-NH2 

2 87 Ac - S Q VI VTGNLD I S TELGNVNNS ISNALDKLEESNSK-NH2 

2 88 AC -DSQVIVTGNLDISTELGNVNNS ISNALDKLEESNS -NH2 

2 89 Ac -LDSQVIVTGNLDISTELGNVNNSISNALDKLEESN-NH2 

2 90 AC - ILDSQVIVTGNLDISTELGNVNNSISNALDKLEES -NH2 

291 Ac - SILDSQVIVTGNLDISTELGNVNNSISNALDKLEE -NH2 

2 92 AC-ISILDSQVIVTGNLDISTELGNVNNSISNALDKLE-NH2 

2 q 293 Ac -NIS ILDSQVIVTGNLDISTELGNVNNS ISNALDKL -NH2 

294 . Ac - KNI S ILDSQVIVTGNLDISTELGNVNNS ISNALDK-NH2 

2 95 Ac -QKNISILDSQVIVTGNLDISTELGNVNNSISNALD -NH2 

296 Ac -YQKNIS ILDSQVIVTGNLDISTELGNVNNS ISNAL-NH2 

2 97 Ac -TYQKNIS ILDSQVIVTGNLDISTELGNVNNSISNA-NH2 

298 AC-ATYQKNISILDSQVIVTGNLDISTELGNVNNSISN-NH2 

299 AC-DATYQKNISILDSQVIVTGNLDISTELGNVNNSIS-NH2 

3 00 Ac - FDATYQKNISILDSQVIVTGNLDISTELGNVNNS I -NH2 
25 301 Ac -EFDATYQKNIS ILDSQVIVTGNLDISTELGNVNNS -NH2 

302 AC -GEFDATYQKNISILDSQVIVTGNLDISTELGNVNN-NH2 

303 Ac -SGEFDATYQKNISILDSQVIVTGNLDISTELGNVN-NH2 
3 04 Ac -LSGEFDATYQKNISILDSQVIVTGNLDISTELGNV-NH2 

305 AC-RLSGEFDATYQKNISILDSQVIVTGNLDISTELGN-NH2 

306 Ac -LRLSGEFDATYQKNISILDSQVIVTGNLDISTELG-NH2 

307 Ac -TLRLSGEFDATYQKNISILDSQVIVTGNLDISTEL -NH2 
3 08 Ac - ITLRLSGEFDATYQKNISILDSQVTVTGNLDISTE-NH2 
309 AC-GITLRLSGEFDATYQKNISILDSQVIVTGNLDIST-NH2 

30 310 Ac -TATIEAVHEVTDGLSQLAVAVGKMQQFVNDQFNNT-NH2 

311 Ac - ITATIEAVHEVTDGLSQLAVAVGKMQQFVNDQFNN -NH2 

312 AC -SITATIEAVHEVTDGLSQLAVAVGKMQQFVNDQFN-NH2 
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Sequence 

Ac -KESITATIEAVHEVTDGLSQLAVAVGKMQQFVNDQ-NH2 
AC - LKES ITATIEAVHEVTDGLSQLAVAVGKMQQFVND -NH2 
Ac - RLKE SITAT I EAVHEVTDGL S QLAVAVGKMQQFVN -NH2 
Ac - LRLKES ITATI E AVHEVTDGLSQLAVAVGKMQQFV -NH2 
Ac - ILRLKESITATIEAVHEVTDGLSQLAVAVGKMQQF-NH2 
Ac - N ILRLKES ITATIEAVHEVTDGLSQLAVAVGKMQQ -NH2 
Ac - ANILRLKES ITATIEAVHEVTDGLS QLAVAVGKMQ -NH2 
Ac -AANILRLKESITATIEAVHEVTDGLSQLAVAVGKM-NH2 
Ac -HKCDDECMNSVKNGTYDYPKYEEESKLNRNEIKGV-NH2 
Ac - KCDDECMNSVKNGTYDYPKYEEESKLNRNEIKGVK-NH2 
Ac - CDDECMNS VKNGTYD YPKYEEESKLNRNE IKGVKL -NH2 
Ac - DDECKNSVKNGTYDYPKYEEESKLNRNEIKGVKLS -NH2 
Ac OECMNSVKNGTYDYPKYEEESKLNRNEIKGVKLSS-NH2 
Ac -ECMNSVKNGTYDYPKYEEESKLNRNEIKGVKLSSM-NH2 
AC-CMNSVKNGTYDYPKYEEESKLNRNEIKGVKLSSMG-NH2 
Ac -MNSVKNGTYDYPKYEEESKLNRNEIKGVKLSSMGV-NH2 
AC-NSVKNGTYDYPKYEEESKLNRNEIKGVKLSSMGVY-NH2 
Ac - S VKNGTYDYPKYEEESKLNRNEIKGVKLSSMGVYQ -NH2 
Ac - VKNGTYPYPKYEEESKLNRNEIKGVKLSSMGVYQI -NH2 
AC - KNGTYDYPKYEEESKLNRNE IKGVKLSSMGVYQIL -NH2 
Ac - AFIRKSDELLHNV-NH2 

Ac - WLAGAALGVATAAQ ITAG I ALHQ SMLNS QAIDNL - NH2 
Ac - VLAGAALGVATAAQITAGIALHQSMLNSQAIDNLR -NH2 
Ac - LAGAALGVATAAQ I TAG IALHQSMLNS QAIDNLRA - NH2 
Ac -AGAALGVATAAQITAGIALHQSMLNSQAIDNLRAS -NH2 
AC - GAALGVATAAQITAGIALHQSMLNS QAIDNLRASL -NH2 
Ac - AALGVATAAQITAGIALHQSMLNSQAIDNLRASLE - NH2 
Ac - ALGVATAAQ ITAG IALHQSMLNSQAIDNLRAS LET - NH2 
AC - LGVATAAQITAGIALHQSMLNS QAIDNLRASLETT -NH2 
Ac - GVATAAQITAG IALHQSMLNSQAIDNLRAS LETTN-NH2 
Ac - VATAAQ ITAG I ALHQS MLNS QAIDNLRASLETTNQ - NH2 
Ac - ATAAQ ITAG I ALHQSMLNSQAIDNLRASLETTNQA - NH2 
Ac -TAAQITAG IALHQSMLNSQAIDNLRAS LETTNQAI -NH2 
AC-AAQITAGIALHQSMLNSQAIDNLRASLETTNQAIE-NH2 
AC-AQITAGIALHQSMLNSQAIDNLRASLETTNQAIEA-NH2 
Ac -Q ITAG IALHQSMLNS QAIDNLRAS LETTNQAIEAI -NH2 
Ac - ITAGIALHQSMLNSQAIDNLRASLETTNQAIEAIR-NH2 
Ac - TAG IALHQS MLNS QAIDNLRAS LETTNQAIEAIRQ - NH2 
Ac - AG IALHQSMLNSQAIDNLRASLETTNQAIEAIRQA - NH2 
Ac - GIALHQSMLNSQAIDNLRASLETTNQAIEAIRQAG -NH2 
Ac - IALHQ SMLNS QAIDNLRAS LETTNQAI EAIRQAGQ-NH2 
AC - ALHQSMLNSQAIDNLRASLETTNQAIEAIRQAGQE -NH2 
AC - LHQSMLNSQAIDNLRAS LETTNQAIEAIRQAGQEM - NH2 
AC - HQSMLNSQAIDNLRAS LETTNQAI EAIRQAGQEMI -NH2 
AC-QSMLNSQAIDNLRASLETTNQAIEAIRQAGQEMIL-NH2 
Ac - SMLNSQAIDNLRASLETTNQAIEAIRQAGQEMILA-NH2 
Ac -MLNS QAIDNLRAS LETTNQAIEAIRQAGQEMILAV-NH2 
Ac - LNSQAIDNLRASLETTNQAIEAIRQAGQEMILAVQ-NH2 
Ac - NS QAIDNLRAS LETTNQAI EAIRQAGQEMILAVQG -NH2 
AC - SQAIDNLRASLETTNQAIEAIRQAGQEMILAVQGV-NH2 
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3 64 AC - QAIDNLRAS LETTNQAI EA IRQAGQEM I LAVQGVQ - NH2 

365 AC - AIDNLRASLETTNQAIEAIRQ AGQEMILAVQGVQD -NH2 

366 Ac - IDNLRASLETTNQAIEAIRQAGQEMILAVQGVQDY-NH2 

367 AC -DNLRASLETTNQAIEAIRQAGQEMILAVQGVQDYI -NH2 

368 Ac - NLRAS LETTNQAIEAIRQAGQEMILAVQGVQD YIN-NH2 

369 AC - LRASLETTNQAIEAIRQAGQEMILAVQGVQDYINN-NH2 

370 Ac - RASLETTNQAIEAIRQAGQEM ILAVQGVQDYINNE -NH2 

371 AC-YTSVITIELSNIKENKUNGTDAVKLIKQELDKYK-NH2 

372 Ac -TSVITIELSNIKENKUNGTDAVKLIKQELDKYKN-NH2 

373 Ac - S VITIELSNIKENKUNGTDAVKLIKQELDKYKNA-NH2 

374 Ac - SNIKENKUNGTDAKVKLIKQELDKYKNAVTELQLL -NH2 

375 AC - KENKUNGTDAKVKLI KQELDKYKNAVTELQLLMQS -NH2 

376 Ac CLELDKWASLWNWFC -NH2 

377 AC - CLELDKWASLANWFC -NH2 

378 Ac - CLELDKWASLFNFFC -NH2 

37 9 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLFNFF -NH2 

381 AC - RMKQLEDKVEELLSKNYHLENELELDKWASLWNWF -NH2 

3 82 " AC-KVEELLSKNYHLENELELDKWASLWNWF-NH2 

383 Ac -RMKQLEDKVEELLSKLEWIRRSNQKLDSI -NH2 

384 Ac -RMKQLEDKVEELLSKLAFIRKSDELLHNV-NH2 

385 Ac - ELEALRGELRALRGELELDKWASLWNWF-NH2 

386 Ac - LDP ID I S IELNKAKSDLEES KEWIRRSNQKLD S I -NH2 

387 Ac - CNEQLSDSFPVEFFQV-NH2 

388 Ac -MAEDDPYLGRPEQMFHLDPSL-NH2 

389 Ac - EDFSSIADMDFSALLSQISS -NH2 

390 Ac -TWQEWERKVDFLEENITALLEEAQIQQEKNMYELQ-NH2 

391 Ac - WQEWERKVDFLEENITALLEEAQIQQEKNMYELQK-NH2 
3 92 Ac -QEWERKVDFLEENITALLEEAQIQQEKNWYELQKL-NH2 
3 93 Ac -EWERKVDFLEENITALLEEAQIQQEKNMYELQKLN-NH2 
394 Ac - WERKVDFLEENITALLEEAQIQQEKNMYELQKLNS -NH2 
3 95 AC-ERKVDFLEENITALLEEAQIQQEKNMYELQKLNSW-NH2 
396 Ac -RKVDFLEENITALLEEAQIQQEKNMYELQKLNSWD-NH2 
3 97 , AC-KVDFLEENITALLEEAQIQQEKNMYELQKLNSWDV-NH2 
3 98 Ac - VDFLEENITALLEEAQIQQEKNMYELQKLNSWDVF-NH2 

3 99 AC -DFLEENITALLEEAQIQQEKNMYELQKLNSWDVFG-NH2 
400 AC - FLEENITALLEEAQIQQEKNMYELQKLNSWDVFGN -NH2 

4 01 AC - LEENITALLEEAQIQQEKNMYELQKLNSWDVFGNW-NH2 
4 02 Ac - LEENITALLEEAQIQQEKNMYELQKLNS WDVFGNWF -NH2 
403 AC -NEQSEEKENELYWAKEQLLDLLFNIFNQTVGAWIMQ-NH2 

4 05 Ac - QQQLLDWKRQQELLRLTVWGTKNLQTRVTAIEKYLKD -NH2 

4 06 Ac -QQLLDWKRQQELLRLTVWGTKNLQTRVTAIEKYLKDQ-NH2 

407 AC - QQLLDVVKRQQELLRLTVWGPKNLQTRVTAIEKYLKDQ-NH2 

408 AC - DERKQD KVL WQQTGTLQLTL I QLEKTAKLQWVRLNRY - NH2 

409 Ac -QQQLLDWKRQQELLRLTVWGTKNLQTRVTAIEKY-NH2 

410 Ac - QQLLD WKRQQELLRLTVWGTKNLQTRVTAIEKYL -NH2 

411 AC-QLLDWKRQQELLRLTVWGTKNLQTRVTAIEKYLK-NH2 

412 Ac - LLDWKRQQELLRLTVWGTKNLQTRVTAIEKYLKD -NH2 

413 Ac -LDWKRQQELLRLTVWGTKNLQTRVTAIEKYLKDQ-NH2 

4 14 AC -DWKRQQELLRLTVWGTKNLQTRVTAIEKYLKDQA-NH2 

415 Ac - WKRQQELLRLTVWGTKNLQTRVTAIEKYLKDQAQ -NH2 
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416 AC -VKRQQELLRLTVWGTKNLQTRVTAIEKYLKDQAQL-NH2 

417 AC-KRQQELLRLTVWGTKNLQTRVTAIEKYLKDQAQLN-NH2 

418 Ac - RQQELLRLTVWGTKNLQTRVTAI EKYLKDQAQLNA -NH2 

419 Ac -QQELLRLTWGTKNLQTRVTAIEKYLKDQAQLNAW-NH2 

420 Ac - QELLRLTVWGTKNLQTRVTAIEKYLKDQAQLNAWG - NH2 

421 Ac - ELLRLTVWGTKNLQTRVTAIEKYLKDQAQLNAWGC -NH2 

422 AC -NNLLRAIEAQQHLLQLTVWGPKQLQARILAVERYLKDQ-NH2 

423 Ac -SELEIKRYKNRVASRKCRAKFKQLLQHYREVAAAK-NH2 

424 Ac - ELE I KRYKNRVASRKCRAKFKQLLQHYREVAAAKS -NH2 

425 Ac - LE IKRYKNRVASRKCRAKFKQLLQHYREVAAAKS S -NH2 

426 Ac -EIKRYKNRVASRKCRAKFKQLLQHYREVAAAKSSE -NH2 

427 Ac - 1 KRYKNRVAS RKCRAKFKQLLQHYRE VAAAKS SEN-NH2 

42 8 Ac - KRYKNRVAS RKCRAKFKQLLQHYREVAAAKS S END -NH2 

429 AC - RYKNRVASRKCRAKFKQLLQHYREVAAAKS S ENDR - NH2 

430 Ac - YKNRVASRKCRAKFKQLLQHYREVAAAKSSENDRL -NH2 
4 31 Ac - KNRVASRKCRAKFKQLLQHYREVAAAKS SENDRLR - NH2 

432 Ac - NRVAS RKCRAKFKQLLQHYREVAAAKS SENDRLRL - NH2 

433 AC - RVAS RKCRAKFKQLLQHYREVAAAKS S ENDRLRLL - NH2 

434 Ac - VAS RKCRAKFKQLLQHYREVAAAKS S ENDRLRLLL - NH2 

435 Ac - ASRKCRAKFKQLLQHYREVAAAKS SENDRLRLLLK-NH2 

43 6 Ac - SRKCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQ -NH2 
437 AC - RKCRAKFKQLLQHYREVAAAKS SENDRLRLLLKQM - NH2 

43 8 Ac - KCRAKFKQLLQHYRE VAAAKS SENDRLRLLLKQMC -NH2 

439 Ac - CRAKFKQLLQHYRE VAAAKS S ENDRLRLLLKQMCP -NH2 

440 Ac - RAKFKQLLQHYREVAAAKS SENDRLRLLLKQMCPS -NH2 

441 Ac -AKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSL-NH2 

442 Ac - KFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLD -NH2 

443 Ac - FKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDV-NH2 

444 AC - KQLLQHYREVAAAKS SENDRLRLLLKQMCPS LDVD -NH2 

445 Ac - QLLQHYRE VAAAKS S ENDRLRLLLKQMCP SLDVDS -NH2 

44 6 Ac -LLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSI -NH2 
44 7 Ac - LQHYREVAAAKSSENDRLRLLLKQMCPSLDVDS II - NH2 
44 8 Ac - QHYREVAAAKS SENDRLRLLLKQM CP SLDVDS I IP -NH2 
44 9 AC-HYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPR-NH2 
4 50 Ac -YREVAAAKS SENDRLRLLLKQMC PS LDVDS I IPRT-NH2 
4 51 Ac - REVAAAKS SENDRLRLLLKQMC PS LDVDS I IPRTP - NH2 

452 AC-EVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPD-NH2 

453 AC -VAAAKS SENDRLRLLLKQMCPS LDVDS I IPRTPDV-NH2 
4 54 Ac - AAAKS SENDRLRLLLKQMCPSLDVDS I IPRTPDVL -NH2 

455 Ac - AAKS S ENDRLRLLLKQMC P S LDVDS 1 1 PRTPDVLH - NH2 

456 Ac -AKS SENDRLRLLLKQMCPSLDVDS 1 1 PRTPDVLHE-NH2 

457 AC - KS S ENDRLRLLLKQMCP S LDVDS 1 1 PRTPD VLHED -NH2 

458 Ac - S SENDRLRLLLKQMCPS LDVDS 1 1 PRTPDVLHEDL - NH2 

459 Ac - S ENDRLRLLLKQMCPSLDVDS 1 1 PRTPDVLHEDLL - NH2 
4 60 Ac - ENDRLRLLLKQMCPSLDVDS 1 1 PRTPDVLHEDLLN-NH2 
461 AC - NDRLRLLLKQMCP SLDVDS 1 1 PRTPD VLHEDLLNF - NH2 
534 AC - PG YRWMCLRRF 1 1 FLF ILLLCL I FLLVLLDYQGML - NH2 
53 5 Ac -GYRWMCLRRFIIFLFILLLCLIFLLVLLDYQGMLP -NH2 

536 Ac - YRWMCLRRF I IFLF ILLLCL I FLLVLLDYQGMLP V - NH2 

537 AC-RWMCLRRFIIFLFILLLCLIFLLVLLDYQGMLPVC-NH2 
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538 AC-WMCLRRFIIFLFILLLCLIFLLVLLDYQGMLPVCP-NH2 

53 9 AC-MCLRRFIIFLFILLLCLIFLLVLLDYQGMLPVCPL-NH2 

540 Ac-CLRRFIIFLFILLLCLIFLLVLLDYQGMLPVCPLI -NH2 

541 Ac - LRRFI IFLF ILLLCL I FLLVLLD YQGMLP VCPLI P -NH2 

542 AC - RRFI IFLF I LLLCL I FLLVLLD YQGMLP VCPL I PG -NH2 

543 Ac -RFIIFLFILLLCLIFLLVLLDYQGMLPVCPLIPGS -NH2 

544 Ac-FIIFLFILLLCLIFLLVI*LDYQGMLPVCPIiIPGSS-NH2 

545 Ac - IIFLFILLLCLIFLLVLLDYQGMLPVCPLIPGSST-NH2 

546 Ac - 1 FLFILLLCL IFLLVLLDYQGMLPVCPLI PGS STT -NH2 

547 Ac-FLFILLLCLIFLLVLIJ>YQGMLPVCPLIPGSSTTS-NH2 

548 Ac -LF ILLLCL IFLLVLLDYQGMLPVCPLI PGS STTST-NH2 

549 Ac -F ILLLCL IFLLVLLDYQGMLPVCPLI PGS STTSTG-NH2 

550 Ac - ILLLCLIFLLVLLDYQGMLPVCPLIPGSSTTSTGP-NH2 

551 Ac - LLLCL I FLLVLLD YQGMLP VCPL I PGS STTS TGP C - NH2 

552 AC-LLCLIFLLVLLDYQGMLPVCPLIPGSSTTSTGPCR-NH2 

553 Ac - LCL I FLLVLLD YQGMLP VCPL I PGS STTSTGPCRT - NH2 ' 

554 AC - CLIFLLVLLDYQGMLPVCPLIPGSSTTSTGPCRTC -NH2 

555 Ac - LIFLLVLLDYQGMLPVCPLIPGSSTTSTGPCRTCM - NH2 

556 Ac - IFLLVLLDYQGMLPVCPLIPGSSTTSTGPCRTCMT-NH2 

557 Ac - FLLVLLDYQGMLPVCPLIPGSSTTSTGPCRTCMTT -NH2 

558 Ac - PPLVLQAGFFLLTRILTIPQSLDSWWTSLNPLGGT-NH2 

559 AC-LLVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGTT-NH2 

560 AC-LVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTV-NH2 

561 Ac - VLQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVC-NH2 

562 Ac - LQAGFFLLTRILT I PQS LDS WWTS LNFLGGTTVCL - NH2 

563 AC-QAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLG-NH2 

564 AC-AGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQ-NH2 

565 Ac -GFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQN-NH2 

566 Ac -FFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNS -NH2 

567 AC -FLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQ-NH2 

568 AC - LLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQS -NH2 

569 Ac - LTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQSP -NH2 
57 0 AC-FWNWLSAWKDLELKSLLEEVKDELQKMR-NH2 

571 Ac -NNLLRAIEAQQHLLQLTVW -NH2 

572 AC - CGGNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ -NH2 

573 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF - NH2 

574 CI 3H2 7CO - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

575 Ac - AVS KG YLS ALRTGWYTS VIT IELSNI KENKUKGTD A - KH2 

576 AC - S ISNIETVIEFQQKNNRLLE ITREFS VNAGVTTPVS -NH2 

577 Ac - DQQI KQ YKRLLDRLI I PLYDGLRQKDVIVSNQESN - NH2 

578 Ac - YSELTNIFGDNIGSLQEKGIKLQGIASLYRTNITEI -NH2 

579 Ac -TS ITLQVRLPLLTRLLNTQIYRVDS ISYNIQNREWY-NH2 

580 Ac - VE IAEYRRLLRTVLEP IRDALNAMTQNIRPVQS VA -NH2 

581 Ac - S YF I VLS IAYPTL SE I KG VI VHRLEGVS YNIGS QEW -NH2 

582 Ac - LKEAIRDTNKAVQS VQSS IGNLIVAIKS -NH2 

583 NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ -NH2 

583 NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 

584 QKQEPIDKELYPLTSL 

585 YPKFVKQNTLKLAT 

586 QYIKANQKFIGITE 
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587 NGQIGNDPNRDILY 

588 AC-RPDVY-OH 

589 CLELDKWASLWNWFO (cyclic) 

590 CLELDKWASLANWFC- (cyclic) 

591 CLELDKWASLANFFC- (cyclic) 

594 AC-NNLLRAIEAQQQHLLQLTWGIKQLQARILAVERYLKDQ-NH2 

5 595 AC-CGGYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNNWF-NH2 

596 Ac - PLLVLQAGFFLLTRILTIPQS LDS WWTSLNFLGGT-NH2 

597 Ac - LL VLQAGFFLLTRILTIPQSLDS WWTSLNFLGGTT -NH2 

598 AC - LVLQAGFFLLTRILTIPQSLDS WWTSLNFLGGTTV - NH2 

599 Ac - VLQAGFFLLTRILTIPQSLDS WWTSLNFLGGTTVC -NH2 

600 AC-LQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCL-NH2 

601 Ac - QAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLG -NH2 

602 AC - AGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQ -NH2 
10 603 Ac -GFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQN-NH2 

604 Ac - FFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNS -NH2 

605 AC-FLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQ-NH2 

606 Ac - LLTRILTI PQSLDSWWTSLNFLGGTTVCLGQNS QS -NH2 

607 Ac - LTRILT IPQSLDS WWTSLNFLGGTTVCLGQNS QSP - NH2 

608 AC-LELDKWASLWNWA-NH2 

609 AC -LELDKWASAWNWF-NH2 

610 AC - LELDKAAS LWNWF -NH2 
15 611 Ac -LKLDKWAS LWNWF -NH2 

612 Ac -LELKKWAS LWNWF -NH2 

613 Ac -DELLHNVNAGKST-NH2 

614 AC - KSDELLHNVNAGKST -NH2 

615 AC - IRKSDELLHNVNAGKST -NH2 

616 AC - AF IRKSDELLHNVNAGKST - NH2 

617 Ac - FDAS ISQVNEKINQSLAFI-NH2 

618 AC - YAADKESTQKAFDGITNKVNS VIEKMNTQFEAVGKE -NH2 
2 0 619 AC - S VIEKMNTQFEAVGKEFGNLERRLENLNKRMEDGFL -NH2 

620 AC-VWTYNAELLVLMENERTLDFHDSNVKNLYDKVRMQL-NH2 

621 AC -EWDRE INNYTSL IHS L IEES QNQQEKNEQEGGC -NH2 

622 AC - INNYTSLIHSLIEESQNQQEKNEQELLELDKWASL -NH2 

623 Ac - INNYTS L IHS L IEESQNQQEKNEQELLE - NH2 

624 AC -WMEWDREINNYTSLIHSLIEESQNQQEKNEQELLE -NH2 

625 AC-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

626 AC - ID IS IELNKAKSDLEESKE WIKKSNQKLDS IGNWH -NH2 

627 Ac -NQQEKNEQELLELDKWASLWNWFNITNWLWYIKIFI -NH2 
627 Ac -NQQEKNEQELLELDKWASLWNWFNITNWLWYIKIFI -NH2 

62 8 Ac - QNQQEKNEQELLELDKWASLWNWFNITNWLWYIKI F -NH2 

629 Ac -SQNQQEKNEQELLELDKWASLWNWFNITNWLWYIKI -NH2 

630 Ac -ESQNQQEKNEQELLELDKWASLWNWFNITNWLWYIK-NK2 

631 Ac -EESQNQQEKNEQELLELDKWASLWNWFNITNWLWYI -NH2 

632 AC - IEESQNQQEKNEQELLELDKWASLWNWFNITNWLWY-NH2 

633 Ac -LIEESQNQQEKNEQELLELDKWASLWNWFNITNWLW-NH2 

634 Ac - SLIEESQNQQEKNEQELLELDKWASLWNWFNITNWL -NH2 
30 635 AC -HSLIEESQNQQEKNEQELLELDKWASLWNWFNITNW-NH2 

63 6 AC-IHSLIEESQNQQEKNEQELLELDKWASLWNWFNITN-NH2 
63 7 AC -LIHSLIEESQNQQEKNEQELLELDKWASLWNWFNIT-NH2 
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638 




639 




640 




641 




642 




643 


5 


644 




645 




646 




647 




648 




649 




650 




651 


10 


652 




653 




654 




655 




656 




657 




658 




659 


15 


660 




661 




662 




663 




664 




665 




666 




667 


20 


668 




669 




670 




671 




672 




673 




674 




675 




676 


Z D 


677 




678 




679 




680 




681 




682 




683 




684 


30 


685 




686 




687 



Sequence 



AC - SLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNI -NH2 
AC-TSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFN-NH2 
AC-NYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNW-NH2 
AC-NNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWN-NH2 
AC - INNYTSLIHSLIEESQNQQEKNEQELLELDKWASLW-NH2 
AC-EINNYTSLIHSLIEESQNQQEKNEQELLELDKWASL-NH2 
AC -REINNYTSLIHSLIEESQNQQEKNEQELLELDKWAS -NH2 
AC-DREINNYTSLIHSLIEESQNQQEKNEQELLEI.DKWA-NH2 
Ac-WDREINNYTSIjIHSLIEESQNQQEKNEQELLELDKW-NH2 
AC-EWDREINNYTSLIHSLIEESQNQQEKNEQELLELDK-NH2 
Ac - ME WDRE INNYTS LIHSL IEESQNQQEKNEQELLELD - NH 
AC - WMEWDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 
AC-TWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLE-NH2 
Ac - MT WMEWDRE INNYTS L IHS L IEESQNQQEKNEQELL - NH2 
Ac -NMTWMEWDRE INNYTS LIHSLIEESQNQQEKNEQEL -NH2 
AC-NNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQE-NH2 
AC-WNNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQ-NH2 
AC - IWNNMTWMEWDREINNYTSLIHSLIEESQNQQEKNE-NH2 
Ac -QIWNNMTWMEWDREINNYTSLIHSLIEESQNQQEKN-NH2 
AC-EQIWNNMTWMEWDREINNYTSLIHSLIEESQNQQEK-NH2 
Ac -LEQIWNNMTWMEWDRE INNYTS LIHSLIEESQNQQE-NH2 
AC-SLEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQQ-NH2 
AC-KSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQ-NH2 
Ac -NKS LEQ IWNNMT WMEWDRE INNYTS L IHSL I EES QN - NH2 
Ac - SLAFIRKSDELLHNVNAGKST-NH2 
Ac - FDASISQVNEKINQSLAFIRK-NH2 

AC-YTSLIHSLIEESQQQQEKQEQELLELDKWASLWNWF-NH2 

AC-FDASISQVNEKINQSLAFIRKSDELLHNVNAGK-NH2 

AC - FDAS ISQVNEKINQSLAFIRKSDELLHNVNA-NH2 

AC-FDASISQVNEKINQSLAFIRKSDELLHNV-NH2 

AC - FDAS ISQVNEKINQSLAFIRKSDELLH-NH2 

AC - FDAS ISQVNEKINQSLAFIRKSDEL - NH2 

Ac - FDAS I SQVNEKINQSLAFIRKSD -NH2 

AC-ASISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 

AC - ISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 

Ac - Q VNEKINQS LAF IRKSDELLHNVNAGKS T - NH2 

Ac -NEKINQSLAFIRKSDELLHNVNAGKST-NH2 

AC - KINQSLAFIRKSDELLHNVNAGKST -NH2 

AC -NQSLAFIRKSDELLHNVNAGKST-NH2 

Ac - FWNWLS AWKDLEL YPGS LELDKWAS LWNWF -NH2 

Ac - CGGNNLLRAIEAQQHLLQLTWG IKQLQARILAVERYLKDQ -NH2 

AC-CGGYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 

NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ 

AC - EKNMYELQKLNS WDVFTNWLDFTSWVRYIQYIQYGV-NH2 

AC - QEKNMYELQKLNSWDVFTNWLDFTSWVRYIQYIQYG-NH2 

Ac - QQEKNMYELQKLNSWDVFTNWLDFTS WVRYIQYIQY -NH2 

Ac - IQQEKNMYELQKLNSWDVFTNWLDFTSWVRYIQYIQ-NH2 

AC -QIQQEKNMYELQKLNSWDVFTNWLDFTSWVRYIQYI-NH2 

AC - AQIQQEKNMYELQKLNSWDVFTNWLDFTSWVRYIQY-NH2 
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688 AC-QAQIQQEKNOTELQKLNSWDVFTNWLDFTSWVRYIQ-NH2 

689 AC - EQAQIQQEKNMYELQKLNSWDVFTNWLDFTS WVRYI -NH2 

690 AC-LEQAQIQQEKNMYELQKLNSWDVFTNWLDFTSWVRY-NH2 

691 AC - SLEQAQIQQEKNMYELQKLNSWDVFTNWLDFTS WVR-NH2 

692 AC - QSLEQAQIQQEKNMYELQKLNSWDVFTNWLDFTSWV-NH2 

693 AC -SQSLEQAQIQQEKNMYELQKLNSWDVFTNWLDFTSW-NH2 
5 694 AC - ISQSLEQAQIQQEKNMYELQKLNSWDVFTNWIiDFTS -NH2 

695 AC - NIS QS LEQAQ IQQEKNMYELQKLNS WD VFTOWLD FT -NH2 

696 Ac -ANISQSLEQAQIQQEKNMYELQKLNSWDVFTNWLDF-NH2 

697 AC - EANI SQS LEQAQ IQQEKNMYELQKLNS WDVFTNWLD -NH2 

699 AC - YLEANISQSLEQAQIQQEKNMYELQKLNSWDVFTNW-NH2 

700 AC-YTSLIHSLIEESQNQQEKNEQEL-NH2 

701 AC-YTSLIHSLIEESQNLQEKUEQELLELDKWASLWNWF-NH2 

702 AC - YTSLIHSLIEESQNQQEKLEQELLELDKWASLWNWF -NH2 
10 703 Ac - YTS L IHSL IEES QNQQEKNEQELLEFDKWAS LWNWF -NH2 

704 AC - YTSLIHSLIEESQNQQEKNEQELLELDKPASLWNWF-NH2 

705 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASPWNWF-NH2 

706 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNSF-NH2 

707 Biotin NH { CH2 ) 4CO-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

708 Biotin NH(CH2) 6C0-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2. 

709 FMOC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 

710 FMOC -NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ 
15 711 Ac -EWDREINNYTSLIHSLIEESQNQQEXNEQE -NH2 

712 Ac - L IEESQNQQEKNEQELLELDKWASLWNWF -NH2 

713 Ac - FWNWLSAWKDLELGGPGSGPGGLELDKWASLWNWF-NH2 

714 AC-LIHSLIEESQNQQEKNEQELLELDKWASL-NH2 

715 Ac - TS L IHS L IEES QNQQEKNEQELLELD KWAS LWNWF - NH2 

716 Ac -L IHSL IEES QNQQEKNEQELLELDKWASLWNWF-NH2 

718 FMOC - GGGGG YTS L IHSL I EESQNQQEKNEQELLELDKWASLWNWF - NH2 

719 AC -HSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

20 7 2 0 AC-YTSLIYSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

721 Ac -YTSLIHSLIEKSQNQQEKNEQELLELDKWASLWNWF-NH2 

722 AC-YTSLIHSSIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

723 AC-LEANISQLLEQAQIQQEKNMYELQKLNSWDVFTNWL-NH2 

724 AC-SLEECDSELEIKRYKNRVASRKCRAKFKQLLQHYR-NH2 

725 Ac -LEECDSELEIKRYKNRVASRKCRAKFKQLLQHYRE-NH2 
72 6 Ac - EE CD S E LE I KR YKNR VAS RKCRAKFKQLLQHYRE V - NH2 

727 AC-ECDSELEIKRYKNRVASRKCRAKFKQLLQHYREVA-NH2 

728 Ac - CDSELEIKRYKNRVASRKCRAKFKQLLQHYREVAA-NH2 

72 9 Ac-DSELEI KRYKNRVAS RKCRAKFKQLL QH YRE VAAA -NH2 

730 Des amino tyros ine -FDASISQVNEKINQSLAFIRKSDELLHNVNAGKST -NH2 

731 WAS LWNW - NH2 

732 AC -EAQQHLLQLTVWGIKQLQARILAVERYLKDQQLLGIWG-NH2 

73 3 AC - IEAQQHLLQLTVWGIKQLQARILAVERYLKDQQLLGIW-NH2 
734 AC - AIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQLLG I -NH2 
73 5 Ac -RAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQLLG -NH2 
73 6 AC - LRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQLL -NH2 

30 737 Ac -LLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQL -NH2 

73 8 AC-NLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQ-NH2 

7 3 9 Ac - QNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKD -NH2 
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740 


Ac- 


QQNNLLRAI EAQQHLLQLTVWGI KQLQARILAVERYLK-NH2 




741 


Ac- 


QQQNNTjLRAIEAQQHLLQLTVWGIKQLQARILAVERYL -NH2 




742 


Ac- 


VQQQNNLLRAI EAQQHLLQLTVWGIKQLQARI LAVERY- NH2 




743 


Ac- 


IVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVER-NH2 




744 


Ac- 


G IVQQQNNLLRAIEAQQHLLQLTVWG I KQLQARILAVE -NH2 




745 


Ac- 


• SGIVQQQNNLLRAIEAQQHLLQLTVWG IKQLQARILAV-NH2 


c 

5 


758 


Ac- 


RSMTLTVQARQLLSGIVQQQNNLLRAIEAQQHLLQLTV-NH2 




760 


Ac- 


•GARSMTLTVQARQLLSGIVQQQNNLLRAIEAQQHLLQL-NH2 




764 


Ac- 


• GSTMGARSMTLTVQARQLLSGIVQQQNNLLRAIEAQQH-NH2 




765 


Ac- 


• GSTMGARSMTLTVQARQLLSG I VQQQNNLLRAIEAQQH -NH2 




766 


Ac- 


• EGSTMGARSMTLTVQARQLLSGI VQQQNNLLRAIEAQQ -NH2 




767 


Ac- 


• RAKFKQLLQHYREVAAAKSSENDRLRLL -NH2 




768 


Ac- 


■ AKFKQLLQHYREVAAAKS S ENDRLRLLL - NH2 




769 


Ac- 


* KFKQLLQHYREVAAAKS S ENDRLRLLLK-NH2 


10 


770 


Ac- 


- FKQLLQH YREVAAAKS S ENDRLRLLLKQ - NH2 




771 


Ac- 


-RAKFKQELQHYREVAAAKSSENDRLRLLLKQMCPS -NH2 




772 


DKWASLWNWF-NH2 




773 


Biotin-FDASISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 




774 


Ac • 


vrp\ 7\ CTeM7*."T»T/TWrteT Tft TC-TOtrCTlTPT T TXK7TC TXT ^ ^ I*" C T* _ TTtT "5 

-xDASlSQ VNEKXNQSiiAr IKIlblJbliljllW VN AO ivb 1 




775 


Ac- 


- YDASISQVNEKINQSLAxIRKSDELIjI^ -Niiz 




776 


Ac* 


- FDAS ISQVNEKxNQSIiAYIKKSDELIiHNVNAG 




777 


Ac- 


- FDAS ISQVQEKIQQSIiAFIRKSDELLHQVQAGKST-NH2 


15 


778 


Ac- 


-FDASISQVNEKINQAIAFIRKADELLHNVNAGKST-NH2 




779 


AC- 


-FDASISQVNEKINQALAFIRKSDELLHNVNAGKST-NH2 




780 


Ac- 


-FDASISQVNEKINQSLAFIRKADELLHNVNAGKST-NH2 




781 


Ac- 


- YD AS I SQVQEE I QQALAF IRKADELLEQVQ AGKST - NH2 




782 


Ac- 


-FDASISQVNEKINQSLAFIRKSDELLENVNAGKST-NH2 




783 


Ac- 


- FDAS ISQVNEEINQSLAFIRKSDELLHNVNAGKST-NH2 




784 


AC« 


-VFPSDEFDASISQVNEKINQSLAFIRKSDELLENV-NH2 




785 


AC- 


-VFPSDEFDASISQVNEEINQSLAFIRKSDELLENV-NH2 


20 


786 


Ac- 


-VYPSDEYDASISQVNEEXNQALAYIRKABELLENV-NH2 


787 


AC 


- VFPSDEFDAS ISQVNEE INQS LAF IRKSDELLHNV -NH2 




788 


Ac 


-SNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQ-NH2 




789 


Ac 


-WSNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEES-NH2 




790 


Ac 


- SWSNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEE-NH2 




791 


AC 


- AS WSNKS LEQ IWNNMTWMEWDRE INNYTSL IHS LIE -NH2 




792 


AC 


-NASWSNKSLEQIWNNMTWMEWDREINNYTSLIHSLI -NH2 




793 


AC 


-WNASWSNKSLEQIWNNMTWMEWDREINNYTSLIHSL-NH2 




793 


AC 


-WNASWSNKSLEQIWNNMTWMEWDREINNYTSLIHSL-NH2 


794 


AC 


- PWNASWSNKSLEQIWNNMTWMEWDREINNYTSLIHS -NH2 




795 


AC 


-VPWNASWSNKSLEQIWNNMTWMEWDREINNYTSLIH-NH2 




796 


Ac 


-AVPWNASWSNKSLEQIWNNMTWMEWDREINNYTSLI -NH2 




797 


AC 


-tavp™aswsnksleqiwnnktwmewdreinnytsl-nh2 




798 


AC 


- TTAVP WNAS WS NKS LEQI WNNMTWMEWDRE INNYTS -NH2 




800 


AC 


-AAASDEFDAS ISQVNEKINQSLAF IRKSDELLHNV -NH2 




801 


Ac 


-VFPAAAFDASISQVNEKINQSLAFIRKSDELLHNV-NH2 


30 


802 


AC 


-VFPSDEAAASISQVNEKINQSLAFIRKSDELLHNV-NH2 


803. 


AC 


-VFPSDEFDAAAAQVNEKINQSLAF IRKSDELLHNV -NH2 




804 


AC 


-VFPSDEFDAS ISAAAEKINQSLAFIRKSDELLHNV-NH2 




805 


AC 


-VFPSDEFDAS ISQVNAAANQSLAFIRKSDELLHNV-NH2 
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8 06 AC-VPPSDEFDASISQVNEKIA7^ALAFIRKSDELLHNV-NH2 

B 07 AC-VFPSDEFDASISQVNEKINQSAAAIRKSDELLHNV-NH2 

808 AC-VFPSDEFDASISQVNEKINQSLAFAAASDELLHNV-NH2 

809 Ac -VFPSDEFDASISQTOEKINQSLAFIRKAAALLHNV-NH2 

810 AC-VFPSDEFDASISQVNEKINQSLAFIRKSDEAAANV-NH2 

811 Ac-VFPSDEFDASISQVNEKINQSIiAFIRKSDELLAAA-NH2 

812 AC-VYPSDEFDASISQVNEKINQSLAFIRKSDELLHNV-NH2 

813 Ac - AAAAIHSLIEESQNQQEKNEOELLELDKWASIjWNWF -NH2 

814 AC-YTSLIHSLIEESQQQQEKNEQELLELDKWASLWNWF-NH2 

815 Ac - YTSLIHSLIEESQNQQEKQEQELLELDKWASLWNWF-NH2 

816 AC-QIWNNMTWMEWDREINNYTSLIHSLIEESQNQQEKQ-NK2 

817 Ac-QIWNNMTWMEWDREINNYTSIiIHSLIEESQQQQEKN-NH2 

818 Ac - QIWNNMTWMEWDREINNYTSLIHSLIEESQQQQEKQ-NH2 

819 Ac -NKSIiEQIWNNMTWMEWDREINNYTSLIHSLIEESQQ - NH2 

820 Ac - FDASISQVNEKINQSLAFIEESDELLHNVNAGKST-NH2 

821 Ac - AC IRKSDELCL - NH2 

823 AC - YTSLIHSLIEESQNQQEKDEQELLELDKWASIiWNWF-NH2 

824 AC - YTSLIHSLIEESQDQQEKNEQELLELDKWASIiWNWF - NH2 

825 AC - YTSLIHSLIEESQDQQEKDEQELLELDKWASLWNWF -NH2 

826 Ac - YTSL IHS L IEESQNQQEKNEQELLELDKWASLWDWF -NH2 

841 Ac -LEANITQSLEQAQIQQEKNMYELQKLNSWDVFTNWL-NH2 

842 AC -LEANISASLEQAQIQQEKNMYELQKLNSWDVFTNWL-NH2 

843 AC -LEANISALLEQAQIQQEKNMYELQIOiNSWDVFTNWL-NH2 

844 AC -LEANITALLEQAQIQQEKNMYELQKLNSWDVFTNWL-NH2 

845 Ac - LEANITASIiEQAQIQQEKNMYELQKLNSWDVFTNWL -NH2 
84 5 Ac-LEANITASIiEQAQIQQEKNMYELQKLNSWDVFTNWL-NH2 

846 AC - RAKFKQLLQHYREVAAAKS SENDRLRLLLKQMUPS - NH2 

847 AC - Abu - DDE - Abu - MNS VKNGT YD YPKYEEE S KLNRJNE IKG VKL - NH2 
856 AC-WQEWEQKVRYLEANISQSLEQAQIQQEKNMYELQKL-NH2 

860 AC -DEYDAS ISQVNEKINQSLAFIRKSDELLHNVNAGK-NH2 

861 Ac - YTS L IHSL IEESQNQQEKNEQELLELDKWAS LWN - NH2 

862 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLW-NH2 

863 - AC - YTSLIHSLIEESQNQQEKNEQELXjELDKWASL -NH2 

864 AC - YTSLIHSLIEESQNQQEKNEQELLELDKWAS -NH2 

865 AC-QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 

866 Ac -DREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

867 AC -NNMTWMEWDREINNYTSLIHSIiIEESQNQQEKNEQELLELDK-NH2 

868 AC - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWAAA-NH2 

869 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWAAAANWF-NH2 

870 AC-YTSLIHSLIEESQNQQEKNEQELLELDAAASLWNWF-NH2 

871 Ac-YTSLIHSLIEESQNQQEKNEQELLAAAKWASLWlimF-NH2 

872 AC -YTSLIHSLIEESQNQQEKNEQAAAELDKWASLWNWF-NH2 

873 AC - YTSLIHSLIEES QNQQEKAAAELLELDKWASLWNWF -NH2 

874 Ac - YTSLIHSLIEESQNQAAANEQELLELDKWASLWNWF -NH2 

875 AC-YTSLIHSLIEESAAAQEKNEQELLELDKWASLWNWF-NH2 
87 6 AC - YTSLIHSLIAAAQNQQEKNEQELLELDKWASLWNWF -NH2 

877 AC-YTSLIHAAAEESQNQQEKNEQELLELDKWASLWNWF-NH2 

878 Ac - YTSAAASLIEESQNQQEKKEQELLELDKWASLV7NWF -NH2 

879 Ac - E IWNNMTWMEWDRENEKINQS LAF IRKSDELLHNV -NH2 

880 Ac -YISEVNEEINQSLAFIRKADELLENVBKWASLWNWF-NH2 
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881 AC -TSVITIELSNIKENKANGTDAKVKLIKQELDKYKN-NH2 

882 YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFMG-NH2 

883 Ac -NEKINQSLAFIRKSDELLHNV-NH2 

884 B i o t in - YDPLVFPSDEFDAS ISQVNEKINQSLAFIRKSDEL -NH2 

885 Biotin-PLVFPSDEFDASISQVNEKINQSLAFIRKSDELLH-NH2 

886 B i o t in - VFPSDEFDAS IS QVNEKINQSLAF IRKSDELLHNV -NH2 
5 887 Biotin-DEFDASISQViraKINQSLAFIRKSDELLHNVNAGK-NH2 

888 Biotin-VyPSDEFDASISQVNEKINQSLAFIRKSDELLHNV--NH2 

889 Biotin-VYPSDEYDASISQVNEEINQALAYIRKADELLENV«NH2 

890 AC-VYPSDEFDASISQVQEEIQQALAFIRKADELLEQV-NH2 

891 Ac -NYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

892 AC - NNYTS L IHS L IEES QNQQEKNEQELLELDKWAS LWNWF - NH2 

893 Ac - INNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

894 AC-EINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 
10 895 AC - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFN-NH2 

896 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNI -NH2 

897 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNIT-NH2 

898 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNITN-NH2 

899 AC-YDPLVFPSDEFDASISQVNEKINQSLAFIRKSDELLHNVNAGK--NH2 

900 AC-NYTSL1HSLIEESQNQQEKNEQELLELDKWASLWNWFN-NH2 

901 Ac - NNYTSL IHSL I EE S QNQQE KNEQELLE LDKWAS LWNWFN I - NH2 

905 AC -KCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDS IIPRTPD -NH2 

15 906 Ac -RAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDS II PRTPD - NH2 

907 AC - VYPSDEYDAS IS QVNEEINQALAYIAAADELLENV-NH2 

909 AC - YD AS IS QVNEEINQALAYIRKADELL -NH2 

910 Ac-M-Nle-WMEWDREINNYTSLIHSLIEESQNQQEKNEQEIiLEL-NH2 

911 Ac - KNGT YD YPKYEEE S KLNRNE I KGVKLS S MG VYQ I - NH2 

912 AC-VTEKIQMASDNINDLIQSGVNTRLLTIQSHVQNYI-NH2 

913 QNQQEKNEQELLELDKWASLWNWF-NH2 

914 AC - QNQQEKNEQELLELDKWASLWNWF -NH2 
2Q 915 LWNWF-NH2 

916 ELLELDKWAS LWNWF - NH2 

917 EKNEQELLELDKWASLWNWF-NH2 

918 SL IEES QNQQEKNEQELLELDKWASIiWNWF -NH2 

919 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNW 

920 AC - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN 

921 AC - YTSLIHSLIEESQNQQEKNEQELLELDKWASLW 

922 Ac-YTSLIHSLIEESQNQQEKNEQELLELDKWASL 

923 TSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

924 S LIHS LIEE S QNQQEKNEQELLELDKWAS LWNWF - NH2 

925 LIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

926 IHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

94 0 AC -AAVALLPAVLLALLAPSELEIRRYKNRVASRKCRAKFKQ 

941 AC -AAVALLPAVLLALLAPCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCP-NH2 

942 AC - YTSLIHSLIEESQNQQEKNNNIERDWEMWTMNNWIQ-NH2 

944 VYPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

945 AC-LMQLARQLMQLARQMKQLADSLMQLARQVSRLESA-KH2 
30 946 AC-WMEWDREINNYTSLIHSLIEESQNQQEKNEQELL-NH2 

947 Ac -MEWDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 

948 AC-EWDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 
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949 AC-MEWDREINNYTSLIHSLIEESQNQQEKNEQELLE-NH2 

950 Biotin-W-Nle - EWDRE INNYTS L IHS L I E ES QNQ QE KNEQELLEL - NH2 

951 AC-YLEYDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 

952 Ac - IKQFINMWQEVGKAMYA-NH2 

953 Ac - IRKS DELL -NH 2 

954 De canoy 1 - IRKSDELL -NH2 

955 Acetyl -Aca-Aca- IRKSDELL -NH2 

956 AC-YDASISQV-NH2 

957 Ac -NEKINQSL-NH2 

958 AC-SISQVNEEINQALAYIRKADELL-NH2 

959 Ac - QVNEE INQALAYIRKADELL -NH2 

960 Ac-EEINQALAYIRKADELL-NH 

961 AC - NQALAYIRKADELL - NH2 

962 Ac - LAYIRKADELL -NH2 

963 FDASISQVNEKINQALAFIRKSDELL-NH2 

964 Ac - W-Nle -EWDREINNYTSLIHSLIEESQNQQEKNEQELLEL -NH2 

965 AC - AS RKCRAKFKQLLQH YRE VAAAKS S ENDRLRLLLKQMCPS LD VD S -NH2 

967 AC-WLEWDREIKNYTSLIHSLIEESQNQQEKNEQELLEL-NK2 

968 Ac - YVKGEPI INFYDPLVFPSDEFDASISQVNEKINQSL -NH2 

969 AC-VYPSDEYDASISQVNEEINQSLAYIRKADELLHNV-NH2 

970 Ac - YDAS ISQVNEEINQALAYIRKADELLENV-NH2 

971 AC - YDAS IS QVNEE INQALAYIRKADELLE - NH2 

972 AC-VYPSDEYDASISQVNEEINQALAYIRKAAELLHNV-NH2 

973 AC - VYPSDEYDAS ISQVNEEINQALAYIRKALELLHNV -NH2 

974 Decanoyl-YTSLIHSLIEESQWQQEKNEQELLELDKWASLWNWF-NH2 

975 Ac - VYPSDEYDAS ISQVNEEINQLLAYIRKLDELLENV-NH2 
97 6 AC -DEYDASISQVNEKINQSLAFIRKSDELL-NH2 

977 Ac -SNDQGSGYAADKESTQKAFDGITNKVNSVIEKTNT-NH2 

97 B AC -ESTQKAFDGITNKVNSVIEKTNTQFEAVGKEFGNLEKR-NH2 

979 Ac -DGITNKVNSVIEKTNTQFEAVGKEFGNLEKRLENLNK-NH2 

980 AC -DSNVKNLYDKVRSQLRDNVKELGNGAFEFYHK-NH2 

981 Ac - RDNVTCELGNGAFE F YHKADDEALNS VKNGTYDYPKY -NH2 

982 Ac - E F YHKADDEALNS VKNGTYDYPKY - NH2 

983 Ac - AAVALLPAVLLALLAPAADKESTQKAFDGITNKVNS -NH2 

984 Ac - AAVALLPAVLLALLAPAADSNVKNLYDKVRS QLRDN -NH2 

985 Ac - KE S TQKAFDG ITNKVNS V - NH2 

986 Ac - I EKTNTQFEAVGKEFGNLER - NH2 

987 AC - RLENLNKRVEDGFLDVWTYNAELLVALENE - NH2 

988 AC -SNVKNLYDKVRSQLRDN-NH2 

98 9 AC-WMEWDREINNYTSLIHSLIEESQNQQEKNEQEL-NH2 

990 Ac - WMEWDREINNYTSLIHSLIEESQNQQEKNEQE -NH2 

991 AC -MEWDREINNYTSLIHSLIEESQNQQEKNEQEL-NH2 

992 Ac -MEWDREINNYTSLIHSLIEESQNQQEKNEQE-NH2 

993 Ac -EWDREINNYTSLIHSLIEESQNQQEKNEQELLE-NH2 

994 AC - E WDREINNYTSLIHSLIEESQNQQEKNEQELL -NH2 

995 AC - EWDREINNYTSL IHSLIEES QNQQEKNEQEL - NH2 

996 Ac - YTKFIYTLLEESQNQQEKNEQELLELDKWASLWNWF-NH2 

997 Ac - YMKQLADSLMQLARQVSRLESA-NH2 

998 AC - YLMQLARQMKQLADSLMQLARQVSRLESA-NH2 

999 AC - YQEWERKVDFLEENITALLEEAQIQQEKNMYELQKL-NH2 
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1000 Ac - WMAWAAAINNYTSLIHSLIEESQNQQEKNEQBEEEE -NH2 

1001 Ac - YASLIAALIEESQNQQEKNEQELLBLAKWAALWAWF-NH2 

1002 [AC-EWDREINNYTSLIHSLIEESQNQQEKNEQEGGC-NH2] dimer 

1003 AC-YDISIELNKAKSDLEESKEWIKKSNQKLDSIGNWH-NH2 

1004 Biot iny 1 - IDIS IELNKAKSDLEESKEWIKKSNQKLDS IGNWH -NH2 

1005 Ac-YTSLI-OH 

1006 Fmoc-HSLIEE-OH 

1007 Fmoc -SQNQQEK-OH 

1008 Fmoc -NEQELLEL - OH 

1009 Fmoc-DKWASL-OH 

1010 Fmoc-WNWF-OH 

1011 AC - AKTLERTWDTLNHLLFIS SALYKLNLKSVAQITLS I -NE2 

1012 AC-NITLQAKIKQFINMWQEVGKAMYA-NH2 

1013 Ac - LENERTLDFHDSNVKNLYDKVRLQLRBN-NH2 

1014 Ac - LENERTLDFHDSNVKNLYDKVRLQLRDNVKELGNG -NH2 

1015 AC-TLDFHDSNVKNLYDKVRLQLRDNVKELGNGAFEF-NH2 

1016 Ac - IDIS IELNKAKSDLEESKEWIKKSNQKLDS IGNWH-NH2 

1021 Biot inyl - S ISQVNEEINQALAYIRKADELL -NH2 

1022 Bi ot inyl - S ISQVNEEINQSLAYIRKSDELL-NH2 

1023 Ac - S I SQVNEEINQS LAYIRKSDELL - NH2 

1024 Ac - ID IS IELNKAKSDLEES KEWIEKSNQELDS IGNWE -NH2 

1025 AC-IDISIELNKAKSDLEESKEWIKKSNQELDSIGNWH-NH2 

1026 Ac - IDIS IELNKAKSDLEEAKEWIDDANQKLDS IGNWH-NH2 

1027 Ac - IDI S IELNKAKSDLEES KEWIKKANQKLDS IGNWH -NH2 

1028 AC-IDISIELNKAKSDLEEAKEWIKKSNQKLDSIGNWH-NH2 

1029 Biotinyl-NSVALDPIDISIELNKAKSDLEESKEWIKKSNQKL-NH2 

1030 Biotinyl-ALDPIDISIELNKAKSDLEESKEWIKKSNQKLDSI-NH2 

1031 desArainoTyrosine-NSVALDPIDISIELNKAKSDLEESKEWIKKSNQKL-NH2 

1032 des AminoTyros ine - ALDPIDIS IELNKAKSDLEESKEWIKKSNQKLDS I -NH2 

1033 Ac - YDAS ISQVNEE INQALAFIRKADEL-NH2 

1034 Ac - YDAS I SQVNEE INQSLAY IRKADELL -NH2 

1035 B iot inyl - YDAS I SQVNEE INQALAYIRKADELL-NH2 

1036 B iot iny 1 - YDAS ISQVNEEINQSLAFIRKSDELL-NH2 

1037 AC - YDAS I SQVNEE INQSLAF IRKSDELL -NH2 

103 8 Ac - WLE WDRE INNYTS L IHS L I EES QNQQEKNEQEL - NH2 

1039 Biot inyl - ID I S IELNKAKSDLEES KEWIRRSNQKLDS IGNWH -NH2 

1044 AC-YESTQKAFDGITNKVNSVIEKTNTQFEAVGKEFGNLEKR-NH2 

1045 Biotin-DEYDASISQVNEKINQSLAFIRKSDELL-NH2 

1046 AC-MEWDREINNYTSLIHSLIEESQNQQEKNEQELL-NH2 

1047 Ac -WQEWEQKVRYLEANISQSLEQAQIQQEKNMYEL-NH2 

1048 Ac - WQEWEQKVRYLEANISQSLEQAQIQQEKNEYEL-NH2 

1049 Ac - WQEWEQKVRYLEANITALLEQAQIQQEKNEYEL -NH2 

1050 Ac - WQEWEQKVRYLEANITALLEQAQIQQEKNMYEL-NH2 

1051 Ac - WQEWEQKVRYLEANISQSLEQAQIQQEKNEYELQKL-NH2 

1052 AC-WQEWEQKVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 

1053 Ac - WQEWEQKVRYLEANITALLEQAQIQQEKNMYELQKL -NH2 

1054 Ac - IDIS IELNKAKSDLEES KEWIEKSNQKLDS IGNWH -NH2 

1055 Ac - EFGNLEKRLENLNKRVEDGFLDVWTYNAELLVALENE -NH2 

1056 Ac -EDGFLDVWTYNAELLVLMENERTLDFHDSNVKNLYDKVRMQL-NH2 

1057 Ac - SIS QVNEKINQSLAFIRKSDELL -NH2 
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1058 desaminoTyr-SISQVNEKINQSLAFIRKSDELL-NH2 

1059 AC-SISQVNEKINQSLAYIRKSDELL-NH2 

1060 Ac- QQLLDVVKRQQEMLRLTWGTKNLQARVTAIEKYLKDQ -NH2 

1061 YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFC 

1062 AC-FDASISQVNEKINQSLAYIRKSDELL-NH2 

1063 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWA 

1064 Indol e - 3 - ace ty 1 - DE FDAS I S QVNE KINQS LAF IRKSDELL -NH2 

1065 Indole -3 -acetyl -DEFDESISQVNEKINQSLAFIRKSDELL-NH2 

1066 Indole -3 -acetyl-DEFDESISQVNEKIEQSLAFIRKSDELL-NH2 

1067 Indole - 3 - acetyl -DEFDES I SQVNEKIEESLAF IRKSDELL -NH2 

1068 Indole - 3 -acetyl -DEFDES I SQVNEKIEESLQF IRKSDELL -NH2 

1069 indole - 3 - acetyl - GGGGGDEFDASISQVNEKINQSLAFIRKSDELL -NH2 

1070 2 - Nap thoyl-DEFDASISQVNEKINQS LAF IRKSDELL -NH2 

1071 de SNH2 Tyr-DEFDAS ISQVNEKINQSLAFIRKSDELL -NH2 

1072 biotin-ALDPIDISIELNKAKSDLEESKEWIRRSNQKLDSI-NH2 

1073 Ac - YDASISQVNEKINQALAYIRKADELLHNVUAGKST -NH2 

1074 AC - VYPSDEYDAS I SQVNEKINQALAYIRKADELLHNV-NH2 

1075 Ac- VYPSDEYDAS ISQVNEKINQSLAYIRKSDELLKNV-NH2 

1076 Ac - WGWGYGYG - NH2 

1077 Ac - YGWGWGWGF -NH2 

1078 Ac - WQEWEQKVRYLEANITALQEQAQ IQAEKAE YELQKL - NH2 

1079 Ac -WQEWEQKVRYLEAEITALQEEAQ IQAEKAE YELQKL -NH2 

1081 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWAS 

1082 AC - VWPSDEFDASISQVNEKINQSLAFIRKSDELLHNV-NH2 

1083 AC - SKNISEQIDQIKKDEQKEGTGWGLGGKWWTSDWGV-NH2 

1084 AC - LSKNISEQIDQIKKDEQKEGTGWGLGGKWWTSDWG -NH2 

1085 Ac - DLS KNIS EQ IDQ I KKDEQKEGTGWGLGGKWWTSDW - NH2 

1086 Ac -EDLSKNISEQIDQIKKDEQKEGTGWGLGGKWWTSD -NH2 

1087 AC - IEDLSKNISEQIDQIKKDEQKEGTGWGLGGKWWTS -NH2 

1088 AC -GIEDLSKNISEQIDQIKKDEQKEGTGWGLGGKWWT -NH2 
108 9 Ac - IGIEDLSKNISEQIDQIKKDEQKEGTGWGLGGKVW-NH2 

1090 2 -Napthoyl --PSDEFDASISQVNEKINQSLAFIRKSDELLHNVN-NH2 

1091 AC - VYPSDEYDAS ISQVNEKINQALAYIRKADELLENV-NH2 

1092 Ac - VYPSDEFDAS I SQVNEKINQALAF IRKADELLENV - NH2 
10 93 AC- VYPSDEYDAS ISQVNEKINQALAYIREADELLENV-NH2 

1094 Biotinyl-YDASISQVNEKINQSLAFIRESDELL-NH2 

1095 Ac -AIGIEDLSKNISEQIDQIKKDEQKEGTGWGLGGKW-NH2 

1096 AC-AAIGIEDLSKNISEQIDQIKKDEQKEGTGWGLGGK-NH2 

1097 AC-DAAIGIEDLSKNISEQIDQIKKDEQKEGTGWGLGG-NB2 

1098 Ac - PDAAIGIEDLSKNI SEQIDQIKKDEQKEGTGWGLG -NH2 

1099 AC -NITDKIDQIIHDFVDKTLPDQGDNDNWWTGWRQWI -NH2 

1100 AC-KNITDKIDQIIHDFVDKTLPDQGDNDNWWTGWRQW-NH2 

1101 AC - TKNITDKIDQI IHDFVDKTLPDQGDNDNWWTGWRQ - NH2 

1102 AC-WTKNITDKIDQIIHDFVDKTLPDQGDNDNWWTGWR-NH2 

1103 AC-DWTKNITDKIDQIIHDFVDKTLPDQGDNDNWWTGW-NH2 

1104 AC - HDWTKNITDKIDQI IHDFVDKTLPDQGDNDNWWTG -NH2 

1105 AC -PHDWTKNITDKIDQIIHDFVDKTLPDQGDNDNWWT-NH2 

1106 AC-EPHDWTKNITDKIDQIIHDFVDKTLPDQGDNDNWW-NH2 

1107 AC - IEPHDWTKNITDKIDQIIHDFVDKTLPDQGDNDNW-NH2 
110 8 AC-AIEPHDWTKNITDKIDQIIHDFVDKTLPDQGDNDN-NH2 
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1109 Ac -AAIEPHDWTKNITDKIDQIIHDFVDKTLPDQGDND-NH2 

1110 AC-DAAIEPHDWTKNITDKIDQIIHDFVDKTLPDQGDN-NH2 

1111 AC-LSPTVWLSVIWMMWYWGPSLYSILSPFLPLLPIFF-NH2 

1112 AC - GLSPTVWLSVIWMMWYWGPSLYS ILSPFLPLLP IF -NH2 

1113 AC - VGLS PTVWLSVI WMMWYWGPSLYS ILSPFLPLLP I-NH2 

1114 AC - FVGLSPTWLS VI WMMWYWGPSLYS ILSPFLPLLP -NH2 
5 1115 AC-WFVGLSPTVWLSVIWMMWYWGPSLYSILSPFLPLL-NH2 

1116 AC - QWFVFLSPTVWLS VIWMMWYWGPSLYS ILSPFLPL -NH2 

1117 AC - VQWFVGLSPTVWLS VIWMMWYWGPSLYS I LS PFLP - NH2 

1118 Ac - FVQWFVGLS PTVWLSVI WMMWYWGPSLYS ILSPFL-NH2 

1119 AC - P FVQWFVGLS PTVWLSVIWMMWYWGPSLYSILSPF-NH2 

1120 AC - VPFVQWFVGLS PTVWLSVI WMMWYWGPSLYS ILS P - NH2 

1121 Ac - LVP FVQWFVGLS PTVWLS VIWMMWYWGPSLYS ILS -NH2 

1122 H - NHTTWMEWDRE INNYTSLIHSLIEE SQNQQEKNEQELLELDKW - OH 

10 1123 H- QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ -OH 

1124 AC-VYPSDEFDASISQVNEKINQSLAFIREADELLENV-NH2- 

1125 AC-VFPSDEFDASISQVNEKINQSLAYIREADELLENV-NH2 
112 6 Ac -DEFDASISQVNEKINQSLAYIREADELL-NH2 

1127 Ac -NEQELLELDKWASLWNWFGGGGDEFDAS ISQVNEKINQSLAFIRKSDELL -NH2 

112 8 Ac -LELDKWASLWNWFGGGGDEFDAS ISQVNEKINQSLAFIRKSDELL -NH2 

112 9 Naphthoyl -EGEGEGEGDEFDAS ISQVNEKINQSLAFIRKSDELL -NH2 
1130 Ac - ASRKCRAKFKQLLQHYREVAAAKS SENDRLRLLLKQMCPSLDV-NH2 

15 1131 Naphthoyl -GDEEDAS ISQVNEKINQSLAFIRKSDELL -NH2 

1132 Naphthoy 1 - GDEEDASESQVNEKINQSLAFIRKSDELL -NH2 

113 3 Naphthoyl-GDEEDASESQQNEKINQSLAFIRKSDELL-NH2 

1134 Naphthoyl -GDEEDASESQQNEKQNQSLAFIRKSDELL-NE2 

1135 Naphthoyl -GDEEDASESQQNEKQNQSEAFIRKSDELL-NH2 
113 6 AC - WGDEFDES ISQVNEKIEESLAFIRKSDELL -NH2 

1137 AC-YTSLGGDEFDESISQVNEKIEESLAFIRKSDELLGGWNWF-NH2 

1138 Ac-YTSLIHSLGGDEFDESISQVNEKIEESLAFIRKSDELLGGWASLWNWF-NH 

113 9 2 -Naphthoyl -GDEFDES ISQVNEKIEESLAFIRKSDELL -NH2 

114 0 2 -Naphthoyl -GDEEDESISQVNEKIEESLAFIRKSDELL-NH2 

1141 2 -Naphthoyl -GDEEDES I SQVQEKIEESLAFIRKSDELL-NH2 

1142 2 - Naphthoyl -GDEEDES I SQVQEKIEESLLFIRKSDELL-NH2 

1143 Biotin-GDEYDESISQVNEKIEESLAFIRKSDELL-NH2 

1144 2 -Naphthoyl -GDEYDES ISQVNEKIEESLAFIRKSDELL -NH2 

1145 AC-YTSLIHSLIDEQEKIEELAFIRKSDELLELDKWNWF-NH2 

1146 VYPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

1147 AC -NNLLRAIEAQQHLLQLTVWGSKQLQARILAVERYLKDQ-NH2 

1148 GGGVYPSDEYDAS ISQVNEEINQALAYIRKADELLENV-NH2 

1149 Ac -NNLLRAIEAQQHLLQLTVWGEKQLQARILAVERYLKDQ-NH2 

1150 Ac - PTRVNYIL I IGVLVLAbuEVTGVRADVHLL -NH2 

1151 Ac - PTRVNYILIIGVLVLAbuEVTGVRADVHLLEQPGNLW -NH2 

1152 Ac - PE KTPLLPTRVNYILI IGVLVLAbuEVTGVRADVHLL - NH2 

1153 AhaGGGVYPSDEYDAS I SQVNEE INQALAYIRKADELLENV - NH2 

1155 AC-YTSLIHSLGGDEFDESISQVNEKIEESLAFIRKSDELL-NH2 

1156 AC-YTSLGGDEFDESISQVNEKIEESLAFIRKSDELL-NH2 

30 1157 AC-DEFDESISQVNEKIEESLAFIRKSDELLGGWASLWNWF-NH2 

1158 Ac - DEFDESISQVNEKIEESLAFIRKSDELLGGWNWF-NH2 

1159 AC-YTSLIHSLIEESQNQQEKNEQELLELDKASLWNWF-NH2 
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1160 AC-YTSLIHSLIEESQNQQEKNEQELLELDKSLWNWF-NH2 

1161 AC-YTSLIHSLIEESQNQQEKNEQELLELDKLWNWF-NH2 

1162 AC - YTSLIHSL IEESQNQQEKNEQELLELDKWNWF - NH2 

1163 AC-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKASLWNWF-NH2 

1164 Ac -MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKSLWNWF-NH2 

1165 AC-MTTO1EWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKLWNWF-NH2 

1166 AC-OTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWNWF-NH2 

1167 AC-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWN-NH2 

1168 AC-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASL-NH2 

1169 (Pyr) HWSY (2-napthyl-D-Ala) LRPG-NH2 

1170 Ac -WNWFDEFDESISQVNEKIEESLAFIRKSDELLWNWF-NH2 

1171 AC-YTSLIHSLIEESQNQQEKNEQELLELDKYASLYNYF-NH2 

1172 AC -YTSLIHSLIEESQNQQEKNEQELLELDKYAYLYNYF-NH2 

1173 2 -Naphthoyl -AcaAcaAcaDEFDESISQVNEKIEESLAFIRKSDELLAcaAcaAcaW-NH2 

1174 2 -Naphthoyl -AcaAcaAcaGDEFDESISQVNEKIEESLAFIRKSDELLGAcaAcaAcaW-NH2 

1175 2 -Naphthoyl -GDEFDESISQVNEKIEESLAFIRESDELL-NH2 

1176 2 -Naphthoyl - GDEFDES ISQVNEKIEESLAFIEESDELL -NH2 

1177 Ac -WQEWEQKVNYLEANITALLEQAQIQQEKNEYELQKL-NH2 
117 8 Ac -WQEWEQKVDYLEANITALLEQAQIQQEKNEYELQKL-NH2 

1179 AC-WQEWEQKVRWLEANITALLEQAQIQQEKNEYELQKL-NH2 

1180 Ac - WQEWEKQVRYLEANITALLEQAQIQQEKNE YELQKL - NH2 

1181 Ac - WQEWEHQVRYLEANITALLEQAQ I QQEKNE YELQKL - NH2 

1182 AC-WQEWEHKVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 

1183 Ac - WQEWDRE VRYLEANITALLEQAQIQQEKNE YELQKL -NH2 

1184 AC - WQEWEREVRYLEANITALLEQAQIQQEKNEYELQKL -NH2 

1185 Ac - WQEWERQVRYLEANITALLEQAQIQQEKNEYELQKL -NH2 

1186 Ac - WQEWEQKVKYLEANITALLEQAQIQQEKNE YELQKL -NH2 

1187 Ac -WQEWEQKVRFLEANITALLEQAQI QQEKNE YELQKL -NH2 

1188 Ac - VNalPSDEYDAS I S QVNEE INQALAYIRKADELLENV -NH2 

1189 Ac - VNal PSDENalDAS ISQVNEEINQALAYIRKADELLENV-NH2 

1190 Ac -VNalPSDEYDAS I SQVNEEINQALANalIRKADELLENV-NH2 

1191 AC-VYPSDEFDASISQVNEKINQSLAFIREADELLFNFF-NH2 

1192 AC-VYPSDEYDASISQVNEEINQALAYIRKADELLFNFF-NH2 

1193 AC-YTSLITALLEQAQIQQEKNEYELQKLDKWASLWNWF-NH2 

1194 AC-YTSLITALLEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

1195 AC-YTSLITALLEQAQIQQEKNEYELQKLDEWASLWEWF-NH2 

1196 Ac - YTSL ITALLEQAQIQQEKNE YELQELDEWASLWEWF -NH2 

1197 Ac -YTSLITALLEEAQIQQEKNE YELQELDEWASLWEWF -NH2 

1198 Naphthoyl - Aua - Aua - Aua -TALLEQAQIQQEKNEYELQKLAua - Aua - Aua - W -NH2 

1199 AC - WAAWEQKVRYLEANITALLEQAQIQQEKNEYELQKL - NH2 

1200 Ac - WQEAAQKVRYLEANITALLEQAQIQQEKNE YELQKL -NH2 

1201 Ac - WQEWAAKVRYLEANITALLEQAQIQQEKNE YELQKL -NH2 
12 02 AC -WQAAEQKVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 
12 03 Ac- WQE WEAAVRYLEAN I TALLEQAQ I QQEKNE YELQKL -NH2 
12 04 Ac - WQE WE QAARYLEANI TALLEQAQ I QQEKNE YELQKL -NH2 
12 05 AC -WQE WE QKAAYLEAN ITALLEQAQIQQEKNE YELQKL -NH2 
12 06 Ac -WQE WE QKVAALEAN I TALLEQAQ I QQEKNE YELQKL -NH2 

12 07 Ac -WQEWEQKVRYLEANITALLEQAQIQQEKNEYELQKLGGGGWASLWNF-NH2 

12 08 2 -Naphthoyl -GDEFDASISQVNEKINQSIiAFIRKSDELT-NH2 

1209 2 -Naphthoyl -GDEFDASISQVNEKINQSLAFTRKSDELT-NH2 
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1210 2 -Naphthoyl -GDEFDAS I SQVNEKTNQSLAFTRKSDELT-NH2 

1211 2-Naphthoyl-GDEFDASISQTNEKTNQSLAFTRKSDELT-NH2 

1212 2 -Naphthoyl-GDEFDASTSQTNEKTNQSLAFTRKSDELT-NH2 

1213 2 -Naphthoyl -GDEYDASTSQTNEKTNQSLAFTRKSDELT-NH2 

12 14 2 -Naphthoyl -GDEFDEEISQVNEKIEESLAFIRKSDELL-NH2 

12 15 2 -Naphthoyl -GDEFDAS I SQVNEKINQSLAFIRKSDELA-NH2 

1216 2 -Naphthoyl- GDEFDAS ASQANEKANQSLAFARKSDELA-NH2 

1217 2 -Naphthoyl -GDEFDES I SQVNEKIEESLAFTRKSDELL-NH2 
1216 2 -Naphthoyl- GDEFDES ISQVNEKTEESLAFIRKSDELL-NH2 

1219 2 -Naphthoyl- GDEFDES ISQTNEKIEESLAFIRKSDELL-NH2 

1220 2 -Naphthoyl -GDEFDESTSQVNEKIEESLAFIRKSDELL-NH2 

1221 Ac - WNWFDEFDESTSQVNEKIEESLAFIRKSDELLWNWF-NH2 

1222 Ac - WNWFDEFDESTS QTNEKIEE S LAF IRKSDELL WNWF - NH2 

1223 Ac - WNWFDEFDESTSQTNEKTEESLAFIRKSDELLWNWF-NH2 

1224 Ac - LQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVAL-NH2 

1225 Ac - YTNLIYTLLEESQNQQEKNEQELLELDKWASLWSWF-NH2 

1226 AC -WQEWEQKVRYLEANITALLEQAQIQQEKNEYELQKLDKWASLWNWF-NH2 

122 7 AC -NNMTWQEWEQKVRYLEANITALLEQAQIQQEKNEYELQKLDKWASLWNWF -NH2 

123 0 Ac - WNWFIEESDELLWNWF -NH2 

1231 2 -Naphthoyl -GFIEESDELLW-NH2 

1232 Ac -WFIEESDELLW-NH2 

123 3 2 -Naphthoyl -GFNFFIEESDELLFNFF-NH2 

1234 2 -Naphthoyl -GESDELW-NH2 

123 5 Ac - WNWFGDEFDES ISQVQEE IEESLAFIEESDELLGGWNWF -NH2 

1236 AC-WNWFIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

123 7 AC - YTSLITALLEQAQIQQEENEYELQALDEWASLWEWF-NH2 

1238 AC-YTSLIHSLGGDEFDESISQVNEEIEESLAFIEESDELLGGWASLWNWF-NH2 

123 9 2 -Naphthoyl -GDEFDESISQVQEEIEESLAFIEESDELL-NH2 

124 0 H - QARQLLSS IMQQQNNLLRAIEAQQHLLQLTVWGIKQLQARIIAVERYLKDQ- OH 

1241 Ac - CPKYVKQNTLKIATGMRNVPEKQTR-NH2 

1242 Ac - GLFGAIAGFIENGWEGMIDGWYGFRHQNSC -NH2 

1243 Ac - LNFLGGT -NH2 

1244 Ac - LDSWWTSLNFLGGT-NH2 

124 5 Ac - 1 LT I PQSLDS WWTS LNFLGGT -NH2 

124 6 Ac -GFFLLTRILT I PQSLDS WWTS LNFLGGT -NH2 

1247 AC - WQEWEQKITALLEQAQIQQEKNEYELQKLDKWASLWNWF -NH2 

124 8 Ac -WNWFITALLEQAQIQQEKNEYELQKLDKWASLWNWF-NH2 

124 9 AC - WQEWEQKITALLEQAQIQQEKNEYELQKLDKWASLWEWF -NH2 

1250 Ac - WQEWEQKVRYLEANITALLEQAQIQQEKIEYELQKL-NH2 

1251 Ac - WQEWEQKVRYLEAQITALLEQAQIQQEKIEYELQKL -NH2 

1252 AC - KENKANGTDAKVKLIKQELDKYKNAVTELQLLMQS -NH2 

1253 Ac -NI KENKANGTDAKVKLIKQELDKYKNAVTELQLLM - NH2 

1254 (FS) -YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

1255 2 -Naphthoyl -GWNWFAcaDEFDESISQVQEEIEESLAFIEESDELLAcaWNWF-NH2 

1256 AC-WNWFGDEFDESISQVNEKIEESLAFIEESDELLGWNWF-NH2 

1257 Ac -WNWFGDEFDESISQVNEKIEESLAFIRKSDELLGWNWF-NH2 

1258 Ac - WNWF-Aca -DEFDESISQVNEKIEESLAFIRKSDELL-Aca -WNWF-NH2 

1259 AC - WNWF - Aca -DEFDES ISQVNEKIEESLAFIEESDELL - Aca - WNWF -NH2 

1260 Ac -EESQNQQEKNEQELLELDKWA-NH2 

1261 EESQNQQEKNEQELLELDKWA 
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12 62 Ac - CGTTDRSGAPTYS WGANDTDVFVLNNTRP PLGNWFG - NH2 

12 63 AC -GVEHRLEAACNWTRGERADLEDRDRSELSP -NH2 

12 64 AC -CVREGNASRAOTAVTPTVATRDGKLPT-NH2 

1265 Ac - CFS PRHHWTTQDANAS IYPG-NH2 

1266 AC - LQHYRE VAAAKS S ENDRLRLLLKQMC P SLDVDS -NH2 

1267 AC-WQEWDREISNYTSLITALLEQAQIQQEKNEYELQKLDEWASLWEWF-NH2 

1268 AC-CWQEWDREISNYTSLITALLEQAQIQQEKNEYELQKLDEWASLWEWFC-NH2 

1269 AC-WQEWDREISNYTSLITALLEQAQIQQEKNEYELQKLDEWEWF-NH2 

1270 AC-CWQEWDREISNYTSLITALLEQAQIQQEKNEYELQKLDEWEWFC-NH2 

1271 AC-GQNSQSPTSNHSPTSAPPTAPGYRWA-NH2 

1272 AC-PGSSTTSTGPARTALTTAQGTSLYPSA-NH2 

1273 AC-PGSSTTSTGPARTALTTAQGTSLYPSAAATKPSDGNATA-NH2 

1275 Ac - WQE WDREITALLEQAQ I QQEKNE YELQKLDKWAS LWNWF - NH2 

1276 AC -WQEWDREITALLEQAQIQQEKNEYELQKLDEWASLWEWF-NH2 

1277 AC - WQEWDRE ITALLEQAQIQQEKNE YELQKLDEWEWF - NH2 

1278 AC - WQEWDREITALLEQAQIQQEKNEYELQKLDEWEWF -NH2 

1279 AC - WQEWERE ITALLEQAQIQQEKNE YELQKL IEWEWF -NH2 

1280 AC - WQEWERE ITALLEQAQ IQQEKI E YELQKLDEWEWF -NH2 

1281 AC - WQEWE ITALLEQAQIQQEKNEYELQKLDEWEWF -NH2 

1282 Ac - WQEWE ITALLEQAQ I QQEKNE YELQKLI EWE WF -NH2 

1283 AC - WQEWEITALLEQAQIQQEKIE YELQKLDEWEWF -NH2 
12 84 AC -WQEWE ITALLEQAQ I QQEKIEYELQKLIEWEWF-NH2 

1285 AC - WQEWDRE IDEYDAS ISQVNEKINQALAYIREADELWEWF -NH2 

1286 Ac - WQEWERE IDEYDAS I S QVNEKINQ ALAYIREADELWEWF -NH2 

1287 Ac - WQEWE IDEYDAS I SQVNEKINQALAYIREADELWEWF -NH2 
12 88 Ac - WQEWDRE IDEYDAS I SQVNEE INQALAYIREADELWEWF -NH2 
1289 AC -WQEWERE IDEYDAS I SQVNEE INQALAYIREADELWEWF -NH2 
12 90 AC - WQEWE IDEYDAS ISQVNEEINQALAYIREADELWEWF-NH2 
1291 Ac - WQEWDEYDASISQVNEKINQALAYIREADELWEWF-NH2 

12 92 AC - WQEWDE YDAS ISQVNEE INQALAYIREADELWEWF -NH2 

12 93 AC-WQEWEQKITALLEQAQIQQEKIEYELQKLIEWEWF-NH2 

1294 Ac - WQEWEQKITALLEQAQIQQEKIE YELQKL IEWASLWEWF - NH2 

1295 Ac -WQEWE ITALLEQAQ I QQEKIE YELQKL IEWASLWEWF -NH2 
12 98 -VYPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

12 99 AC-WVYPSDEYDASISQVNEEINQALAYIRKADELLENVWNWF-NH2 

13 00 YTSLIHSLIEESQNQQEKNEQELLELDKWAS LWNWF -NH2 
13 01 Ac - WQEWDE YDAS IS QVNEKINQALAYIREADELWAWF -NH2 
13 02 AC - WQAWDEYDAS IS QVNEKINQALAYIREADELWAWF -NH2 

1303 Ac - WQAWDEYDAS I S QVNEKINQALAYIREADELWEWF -NH2 

1304 Biotin-YDPLVFPSDEFDASISQVNEKINQSLAFIRKSDEL-NH2 

1305 Biotiil- YDPLVFPSDEFDAS ISQVNEKINQSLAF -NH2 
13 06 Biotin-QVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 
13 07 AC - WMEWDRE I - NH2 

1308 AC - WQEWEQKI - NH2 

1309 Ac - WQEWEQKITALLEQAQIQQEKIEYELQKLIKWASLWEWF -NH2 

1310 Ac - WQE WEQ KI TALLEQAQ I QQE KIE YE LQKL I EWASLWE WF - NH2 

1311 Ac - WQEWEREISAYTSLITALLEQAQIQQEKIEYELQKLIEWEWF -NH2 

13 12 Ac - WQEWEREISAYTSLITALLEQAQIQQEKIEYELQKEWEWF-NH2 

1313 AC - WQEWERE I S AYTS L ITALLEQAQ I QQEKIE YELQKE WE W - NH2 

13 14 Ac - WQEWERE I S AYTS L I TALLEQAQ I QQEKIE YELQKL I EWE W - NH2 
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1315 AC - FNLSDHSES IQKKFQLMKKHVNKIGVDSDP IGS WLR - NH2 

1316 AC-DHSESIQKKFQLMKKHVNKIGVDSDPIGSWLRGIF-NH2 

1317 AC - WS VKQANLTTS LLGDLLDD VTS IRHAVLQNRA -NH2 

1318 Biotin-WMEWDREI-NH2 

1319 Biotin-NNMTWMEWDREINNYTSL-NH2 

1320 Ac -GAASLTLTVQARQLLSGIVQQQNNLLRAIEAQQHLL-NH2 

1321 Ac -ASLTLTVQARQLLSGIVQQQNNLLRAIEAQQHLLQL-NH2 

1322 Ac - VS VGNTLYYVNKQEGKSLYVKGEPI INFYDPLVF -NH2 

1323 AC - QHWS YGLRPG - NH2 

1324 AC - WQEWEQKIQHWS YGLRPGWASLWEWF -NH2 

1325 AC-WQEWEQKIQHWSYGLRPGWEWF-NH2 

1326 AC - WNWFQHWS YGLRPGWNWF - NH2 

1327 Ac - FNFFQHWSYGLRPGFNFF-NH2 

1328 Ac -GAGAQHWS YGLRPGAGAG -NH2 

132 9 PLLVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGT 

133 0 AC-WQEWEQKITALLEQAQIQQEKIEYELQKLAKWASLWEWF-NH2 
13 31 Ac - WQEWEQKITALLEQAQIQQEKIEYELQKLAEWASLWEWF-NH2 

1332 Ac - WQEWEQKITALLEQAQIQQEKAEYELQKLAEWASLWEWF -NH2 

1333 AC -WQEWEQKITALLEQAQIQQEKAEYELQKLAEWASLWAWF-NH2 

1334 AC - WQEWEQKITALLEQAQIQQEKAEYELQKLAKWASLWAWF-NH2 

1335 AC -TNKAWSLSNGVSVLTSKVLDLKNYIDKQLLPIVNK-NH2 
133 6 AC - KAWSLSNGVSVLTSKVLDLKNYIDKQLLPIVNKQS -NH2 
133 7 Ac - WQEWEQKITALLEQAQIQQEKNEYELQKLIEWEWF -NH2 
133 8 Ac - WQEWEQKITALLEQAQIQQEKNEYELQKLIEWEWF -NH2 

133 9 Ac - WQEWEQKITALLEQAQIQQEKIEYELQKLDKWEWF -NH2 

1340 Ac - YDPLVFPSDEFDASISQVNEKINQSLAF-NH2 

1341 Fluor--VYPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

1342 Fluor - YTSLIHSLIEES QNQQEKNEQELLELDKWASLWNWF -NH2 

1344 Ac - SGIVQQQNNLLRAIEAQQHLLQLTVWG IKQLQARIL -NH2 

1345 AC - QQQNNLLRAIEAQQHLLQLTVWG IKQLQARILAVERYLKDQ -NH2 

1346 AC - SGIVQQQNNLLRAIEAQQHLLQLTWGIKQLQARILAVERYLKDQ-NH2 

1347 Ac - WQE WEQKITALLEQAQ I QQEKNE YELQKLAE WAS LWAWF - NH2 

134 8 Ac - WQE WEQKITALLEQAQ I QQEKNE YELQKLAE WAS L WAW - NH2 
134 9 Ac - WQEWEQKITALLEQAQIQQEKAEYELQKLAEWASLWAW- NH2 
13 5 0 Ac - WQEWEQKITALLEQAQIQQEKNEYELQKLAEWAGLWAWF -NH2 
13 51 Ac -WQE WEQKITALLEQAQ IQQEKNEYELQKLAEWAGLWAW-NH2 
1352 Ac -WQEWEQKITALLEQAQIQQEKAEYELQKLAEWAGLWAW-NH2 
13 53 AC - WQE WEQKITALLEQAQIQQEKNEYELQKLDKWAGLWEWF -NH2 

1354 Ac - WQEWQHWS YGLRPGWE WF - NH2 

1355 AC - WQAWQHWS YGLRPGWAWF - NH2 

13 5 6 Biot inyl -WQEWEQKITALLEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

1357 WQEWEQKITALLEQAQIQQEKNEYELQKLDKWASLWEWF 

1358 WQEWEQKITALLEQAQIQQEKIEYELQKLIEWEWF 

13 61 AC - AGSTMGARSMTLTVQARQLLSG IVQQQNNLLRAIEAQQ - NH2 

1362 AC - AGSAMGAASLTLSAQSRTLLAG IVQQQQQLLD WKRQQ -NH2 

1363 AC-AGSAMGAASTALTAQSRTLLAGIVQQQQQLLDWKRQQ-NH2 
13 64 Ac -ALTAQSRTLLAGIVQQQQQLLDWKRQQELLRLTVWGT-NH2 
13 65 Ac - TLS AQSRTLLAGI VQQQQQLLDWKRQQEMLRLTVWGT- NH2 

1366 AC - TLTVQARQLLSGI VQQQNNLLRAIEAQQHLLQLTVWG I - NH2 

1367 AC - WQAWIE YEAELSQVKEKIEQSLAYIREADELWAWF -NH2 
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1368 
1369 
1370 
1371 
1372 
1373 
1374 
1375 
1376 
1377 
1378 
1379 
1380 
1381 
1382 
1383 
1384 
1385 
1386 
1387 
1388 
1389 
1390 
1391 
1392 
1393 
1394 
1395 
1396 
1397 
1398 
1399 
1400 
1402 
1403 
1404 
1405 
1406 
1407 
1408 
1409 
1410 
1411 
1412 
1413 
1414 
1415 
1416 
1417 
1418 



Ac -WQAWIEYEASLSQAKEKIEESKAYIREADELWAWF -NH2 — — - 

AC - WQAWIEYERLLVQAKLKIAIAKLY IAKELLEWAWF -NH2 
Ac - WQAWIE YERLLVQVKLKIAIALLYIAKELLEWAWF -NH2 
Ac -WQAWIELERLLVQVKLKLAIAKLE IAKELLEWAWF -NH2 
Ac - GE WTYDDATKTFTVTEGGH -NH2 

AC - WQEWEQKIGEWTYDDATKTFTVTEGGHWASLWEWF -NH2 
Ac - GEWTYDDATKTFTVTE -NH2 

AC - WQEWEQKI GEWTYDDATKTFTVTE WAS LWEWF - NH2 
Ac -MHRFDYRT-NH2 

Ac - WQE WEQKIMHRFDYRTWAS LWEWF - NH2 
AC - MHRFNWSTGGG -NH2 

AC - WQEWEQKIMHRFNWSTGGGWASLWEWF -NH2 
Ac - MHRFNWS T -NH2 

Ac - WQEWEQKIMHRFNWSTWASLWEWF -NH2 
Ac - LLVPLARIMTMSS VHGGG -NH2 

AC-WQEWEQKILLVPLARIMTMSSVHGGGWASLWEWF-NH2 ' 
AC - LLVPLARIMTMSS VH - NH2 

Ac - WQEWEQKI LLVPLARIMTMSSVHWASLWEWF-NH2 

TALLEQAQIQQEKNEYELQKLDK 

Ac -TALLEQAQIQQEKNEYELQKLDK-NH2 

Ac -TALLEQAQIQQEKIEYELQKLIE-NH2 

TALLEQAQIQQEKIEYELQKLIE 

AC-QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERY-NH2 

Rhod - QARQLLS G I VQQQNNLLRAI E AQQHLLQLTVWG I KQLQARI LAVER¥ - NH2 

Ac - GAASLTLSAQSRTLLAGIVQQQQQLLDWKRQQEML -NH2 

Ac - GSAMGAASLTLSAQSRTLLAGIVQQQQQLLDWKRQQEML -NH2 

AC - PALSTGLIHLHQNIVDVQFLFGVGS S IASWAIKWEY-2JH2 

Ac - PALSTGLIHLHQNIVDVQFLYGVGS S IASWAIK-NH2 

AC-LSTTQWQVLPUSFTTLPALSTGLIHLHQNIVDVQY-NH2 

Ac - FRKFPEATFSRUGSGPRITPRUMVDFPFRLWHY-NH2 

AC -DFPFRLWHFPUTINYTIFKVRLFVGGVEHRLEAAUNWTR-NH2 cT 

Ac - YVGGVEHRLEAAUNWTRGERUDLEDRDRS ELS PL-NH2 

MVYPSDEYDASISQVNEEINQALAYIRKADELLENV 

AC-GPLLVLQAGFFLLTRILTIPQSLDSWWTSLNFLGG-NH2 

AC-LGPLLVLQAGFFLLTRILTIPQSLDSWWTSLNFLG-NH2 

Ac - FLGPLLVLQAGFFLLTRILTIPQSLDSWWTSLNFL -NH2 

Ac - YTNTI YTLLEESQNQQEKNEQELLELDKWASLWNWF -NH2 

YTNTIYTLLEESQNQQEKNEQELLELDKWASLWNWF 

Ac - YTGIIYNLLEESQNQQEKNEQELLELDKWANLWNWF -NH2 

YTGI I YNLLEESQNQQEKNEQELLELDKWANLWNWF 

AC-YTSLIYSLLEKSQIQQEKNEQELLELDKWASLWNWF-NH2 

YTSLIYSLLEKSQIQQEKNEQELLELDKWASLWNWF 

Ac - EKS Q IQQEKNEQELLELDKWA - NH2 

EKSQIQQEKNTEQELLELDKWA 

AC -EQAQIQQEKNEYELQKLDKWA-NH2 

AC-YTSLIGSLIEESQIQQERNEQELLELDRWASLWEWF-NH2 

AC-YTXLIHSLIXESQNQQXKNEQELXELDKWASLWNWF-NH2 

AC - YTXLIHSLIWESQNQQXKNEQELXELD -NH2 

AC-YTSLIHSLIEESQNQQEKNEQELLELD-NH2 

Ac - WQEQEXKITALLXQAQIQQXKNE YE LXKLDKWAS LWEWF -NH2 
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1419 AC - XKITALLXQAQIQQXKNEYELXKLDKWASLWEWF -NH2 

142 0 AC - WQE WWXKITALLXQAQ I QQXKNE YELXKLD -NH2 

1421 Ac - WEQKITALLEQAQ I QQE KNE YELQKLD -NH2 

1422 Ac -WEXKITALLXQAQIQQXKNE YELXKLD -NH2 

1423 Ac - XKITALLXQAQ IQQXKNE YELXKLD -NH2 

1425 Ac - QKITALLEQAQIQQEKNE YELQKLD -NH2 

1426 Ac- QKITALLEQAQIQQEKNE YELQKLDKWASLWEWF-NH2 

1427 Ac - WQEWEQKITALLEQAQIQQEKNE YELQKLD -NH2 

1428 AC -VYPSDEYDAS ISQVNEE INQALAYIRKADELLEN- OH 

1429 Ac - VYPSDEYDAS ISQVNEE INQALAYIRKADELLE -OH 

1430 AC - VYPSDEYDAS I SQVNEE INQALAYIRKADELL - OH 

1431 Ac-VYPSDEYDASISQVNEEINQALAYIRKADEL-OH 

1432 YPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 
14 33 PSDEYDAS I SQVNEE INQALAYIRKADELLENV-NH2 

1434 SDEYDAS ISQVNEE INQALAYIRKADELLENV-NH2 

1435 DEYDAS ISQVNEE INQALAYIRKADELLENV-NH2 

1436 Ac - VYP SDEYDAS I S QVDEE INQALAYIRKADELLENV -NH2 

1437 Ac - VYPSDEYDAS I S QVNEE IDQALA YIRKADELLENV - NH2 

1438 Ac - VYPSDEYDAS IS QVNEE INQALAYIRKADELLEDV-NH2 

1439 Ac - VYPSDEYDAS ISQVDEEIDQALAYIRKADELLENV-NH2 

1440 Ac -LLSTNKAWSLSNGVSVLTSKVLDLKNYIDKQLLP -NH2 

1441 AC-LSTNKAWSLSNGVSVLTSKVLDLKNYIDKQLLPI-NH2 

1442 Ac - STNKAWSLSNGVSVGTSKVLDLKNYIDKQLLPIV-NH2 

1443 AC-TNKAWSLSNGVSVLTSKVLDLKNYIDKQLLPIVN-NH2 

1444 AC-NKAWSLSNGVSVLTSKVLDLKNYIDKQLLPIVNK-NH2 

1445 AC - KAWSLSNGVS VLTSKVLDLKNYIDKQLLP IVNKQ -NH2 

1446 Ac -AWSLSNGVS VLTSKVLDLKNYIDKQLLP IVNKQS -NH2 

1447 AC- WSLSNGVSVLTSKVDLKNYIDKQWLLPIVNKQSU-NH2 
144 8 Ac -VSLSNGVS VLTSKVLDLKNYIDKQLLP I VNKQSUS -NH2 
144 9 Ac - SLSNGVSVLTSKVLDLKNYIDKQLLPIVNKQSUS I -NH2 

1450 Ac - LSNGVSVLTSKVLDKLKNYIDKQLLP I VNKQSUS IS -NH2 

1451 AC - SNGVSVLTSKVLDLKNYIDKQLLP I VNKQSUS ISN-NH2 

1452 Ac - NGVS VLTS KVLDLKNYIDKQLLPI VNKQSUS I SNI - NH2 

1453 * Ac - GVSVLTSKVLDLKNYIDKQLLPIVNKQSUSISNIE -NH2 

1454 AC -VS VLTSKVLDLKNYIDKQLLP I VNKQSUS IS INI ET -NH2 

1455 AC - S VLTS KVLDLKNYIDKQLLP I VNKQSUS I SNIETV - NH2 

1456 Ac - VLTSKVLDLKNYIDKQLLPIVNKQSUS ISNIETVI -NH2 

1457 Ac - LTS KVLDLKNYIDKQLLP IVNKQS US I SNIETVIE -NH2 

1458 Ac - TS KVLDLKNYIDKQLLP I VKQSUS I SNIETVIEF -NH2 

1459 Ac - S KVLDLKNYIDKQLLPIVNKQSUS ISNIETVIEFQ -NH2 

1460 Ac -KVLDLKNYIDKQLLP I VNKQSUS I SNIETVIEFQQ-NH2 

1461 AC - VLDLKNYIDKQLLPIVNKQSUS ISNIETVIEFQQK-NH2 

1462 AC - LDLKNYIDKQLLPIVNKQSUS ISNIETVIEFQQKN -NH2 

1463 Ac -DLKNYIDKQLLPIVNKQSUS ISNIETVIEFQQKNN-NH2 

1464 Ac - LKNYIDKQLLPIVNKQSUS ISNIETVIEFQQKNNR -NH2 

1465 AC - KNYIDKQLLPIVNKQSUSISNIETVIEFQQKNNRL -NH2 

1466 AC-NYIDKQLLPIVNKQSUSISNIETVIEFQQKNNRLL-NH2 
14 67 Ac - YIDKQLLPIVNKQSUSISNIETVIEFQQKNNRLLE -NH2 
1468 AC - IDKQLLPIVNKQSUSISNIETVIEFQQKNNRLLEI -NH2 
14 69 Ac -DKQLLP I VNKQSUS ISNIETVIEFQQKNNRLLEIT-NH2 
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1470 Ac - KQLLPIVNKQSUS ISNIETVIEFQQKNNRLLEITR-NH2 

1471 Ac -QLLPIVNKQSUSISNIETVIEFQQKNNRLLEITRE-NH2 

147 2 Ac - VYP SDEYDAS I S QVNEE INQ ALA 

1473 QVNEE INQALAYIRKADELLENV - NH2 

1474 VYPSDEYDAS ISQVNEEINQALAYIRKADELLENV 

1475 AC - DEYDAS ISQVNEEINQALAYIREADEL-NH2 

1476 Ac -DEYDAS I SQVNEKINQALAYIREADEL-NH2 

1477 Ac -DDECLNSVKNGTYDFPKFEEESKLNRNEIKGVKLS -NH2 

1478 Ac-DDE-Abu-LNSVKNGTYDFPKFEEESKLNRNEIKGVKLS-NH2 

1479 Ac - YHKCDDE CLNS VKNGTFDFPKFEEES KLHRNE IKGVKLS S - NH2 

148 0 Ac - YHK- Abu - DDE - Abu - LNS VKNGTFDFPKFEEES KLNRNE I KGVKLS S -NH2 
14 81 Ac - YTS L IHS L IEES QI QQEKNEQELLELDKWAS LWNWF - NH2 

1482 Ac - YTSLIHSLIEESQNQQEKNEYELLELDKWASLWNWF-NH2 

14 83 Ac - YTSLIHSLIEESQIQQEKNEYELLELDKWASLWNWF-NH2 

14 84 AC-YTSLIHSLIEESQIQQEKNEYELQKLDKWASLWNWF-NH2 

14 85 Ac - YTSLIHSLIEESQNQQEKNEQELQKLDKWASLWNWF -NH2 

14 86 Ac - YTSLIHSLIEESQNQQEKNE YELQKLDKWASLWNWF -NH2 

1487 Ac - YTS Ij IHS L IEESQI QQEKNEQELQKLDKWASLWNWF - NH2 

14 8 8 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWEWF-NH2 

14 89 AC - YTSLIHSLIEESQIQQEKNEQELLELDKWASLWEWF -NH2 

1490 Ac - YTSL IHSLIEESQNQQEKNEYELLELDKWASLWEWF -NH2 

1491 AC - YTSLIHSLIEESQIQQEKNEYELLELDKWASLWEWF -NH2 

1492 AC-YTSLIHSLIEESQIQQEKNEYELQKLDKWASLWEWF-NH2 

1493 AC - YTSLIHSLIEESQNQQEKNEQELQKLDKWASLWEWF -NH2 

1494 AC - YTSLIHSLIEESQNQQEKNEYELQKLDKWASLWEWF-NH2 

1495 AC - YTSLIHSLIEESQIQQEKNEQELQKLDKWASLWEWF -NH2 

1496 Ac -WQEQEQKITALLEQAQIQQEKNEYELQKLDKEWWF-NH2 

1497 AC-WQEWEQKITALLEQAQIQQEKNEYELQKLIEWASLWEWF-NH2 

14 98 Ac- WQEWEQKITALLEQAQIQQEKNEYELQKLAKWASLWEWF-HH2 

1499 AC-WQEWEQKITALLEQAQIQQEKNEYELQKLIKWASLWEWF-NH2 

1500 Ac - WQEWEQKITALLEQAQ IQQEKNE YELQKL IEWAGLWEWF - NH2 

1501 Ac -WQEWEQKITALLEQAQIQQEKNEYELQKIAKWAGLWEWF-NH2 

1502 Ac -WQEWEQKITALLEQAQ IQQEKNE YELQKL I KWAGLWEWF-NH2 

1503 AC -WQEWEQKITALLEQAQ IQQEKNE YELQKL IEWAGLWAWF-NH2 

1504 AC - WQEWEQKITALLEQAQIQQEKNE YELQKLAKWAGLWAWF -NH2 

1505 Ac - WQEWEQKITALLEQAQIQQEKNEYELQKLIKWAGLWAWF -NH2 
150 6 AC-WQEWEQKITALLEQAQIQQEKGEYELQKLDKQEQF-NH2 
1507 AC - WQEWEQKITALLEQAQ IQQEKGE YELLELDKWE WF - NH2 

15 08 AC - WQEWEQKITALLEQAQ IQQEKGE YELQKLAKWEWF - NH2 

1509 AC - WQEWEQKITALLEQAQIQQEKGE YELQKLDWQWEF - NH2 

1510 Ac - WQEWEQKITALLEQAQIQQEKGE YELLELAKWEWF - NH2 

1511 AC - WEQWEQKITALLEQAQIQQEKNE YELLELDKWEWF -NH2 

1512 AC-WQEWEQKITALLEQAQIQQEKNEYELEEELIEWASLWEWF-NH2 

1513 AC - WQEWEQKITALLEQAQIQQEKNE YELLELIEWAGLWEWF -NH2 

1514 AC - WQEWEQKITALLEQAQIQQEKNE YELLELIEWAGLWAWF-NH2 

1515 AC - WQEWERE ITALLEQAQIQQEKNEYELQKLIEWASLWEWF -NH2 

1516 AC - WQEWERE I QQEKNE YELQKLDKWASLWEWF -NH2 

1517 AC - WQEWERE IQQEKGE YELQKL IEWEWF-NH2 

1518 AC - WQEWQAQ IQQEKNEYELQKLDKWASLWEWF -NH2 

1519 AC - WQEWQAQIQQEKGEYELQKLIEWEWF-NH2 
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1520 PEG - GWQEWEQRITALLEQAQI QQERNEYELQRLDEWASLWEWF -NH2 

1521 A.C-GWQEWEQRITALLEQAQIQQERNEYELQRLDEWASLWEWF-NH2 

1522 PEG - YTSLITALLEQAQ IQQERNEQELLELDEWASLWEWF -NH2 

1523 Ac - YTSLITALLEQAQIQQERNEQELLELDEWASLWEWF -NH2 

1526 PEG - GWQEWEQRITALLEQAQIQQERNEYELQELDEWASLWEWF -NH2 

1527 AC - GWQEWEQRITALLEQAQIQQERNEYELQELDEWASLWEWF - NH2 
5 1528 PEG-YTSLIGSLIEESQIQQERNEQELLELDRWASLWEWF-NH2 

1529 PEG - GWQEWEQRITALLEQAQIQQERNEYELQRLDRWAS LWEWF -NH2 

1530 Ac -GWQEWEQRITALLEQAQIQQERNEYELQRLDRWASLWEWF-NH2 

1531 PEG - GWQEWEQRITALLEQAQ IQQERNEYELQELDRWAS LWEWF - NH2 

1532 Ac -GWQEWEQRITALLEQAQIQQERNEYELQELDRWASLWEWF -NH2 
153 3 PEG-YTSLIGSLIEESQNQQERNEQELLELDRWASLWNWF-NH2 
1534 Ac - YTSLIGSLIEESQNQQERNEQELLELDRWASLWNWF-NH2 

153 8 Ac - YTSLIHSLIEESQNQQEK-OH 
10 153 9 NEQELLELDK 

154 0 WAS LWNWF -NH2 

1542 Ac -AAAWEQKITALLEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

154 3 Ac - WQEAAAKITALLEQAQ IQQEKNE YELQKLDKWAS LWEWF - NH2 

1544 AC-WQEWEQAAAALLEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

1545 Ac -WQEWEQKITAAAEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

1546 Ac -WQEWEQKITALLAAAQ IQQEKNE YELQKLDKWAS LWEWF -NH2 

1547 Ac - WQEWEQKITALLEQAAAAQEKNEYELQKLDKWAS LWEWF - NH2 
15 1548 Ac -WQEWEQKITALLEQAQIQAAANEYELQKLDKWAS LWEWF -NH2 

1549 Ac - WQE WEQKITALLEQAQI QQEKAAAELQKLDKWAS LWEWF - NH2 

1550 AC - WQEWEQKITALLEQAQIQQEKNEYAAAKLDKWASLWEWF-NH2 

1551 AC -WQEWEQKITALLEQAQ IQQEKNE YELQAAAKWAS LWEWF -NH2 

1552 AC - WQEWEQKITALLEQAQ IQQEKNE YELQKLDAAAS LWEWF -NH 

1553 Ac-WQEWEQKITALLEQAQIQQEKNEYELQKLDKWAAAAEWF-NH 

1554 Ac - WQE WEQKI TALLEQAQ I QQEKNE YELQKLDKWAS LWAAA - NH 

1556 AC -YTSLIHSLIEESQNQQEKNEQELLLDKWAS LWNWF -NH2 

1557 Ac -YTSLIHSLIEESQNQEKNEQELLELDKWAS LWNWF -NH2 

1558 Ac - ERTLDFHD S -NH2 

1559 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN (W) F-NH2 

1563 Ac - YTSLIHSLIEESQN (Q) QEKNEQELLELDKWASLWNWF-NH2 

1564 Ac -YTSLIHSLIEESQNQQDKWASLWNWF-NH2 

1566 Ac - F YE I IMD IEQNNVQGKKGIQQLQKWEDWVGWIGNI -NH2 

1567 Ac - INQT IWNHGNITLGEWYNQTKDLQQKFYEI IMDIE -NH2 

1568 AC - WNHGNITLGEWYNQTKDLQQKF YE I IMD IEQNNVQ -NH2 

1572 Ac - YTS L IHS L IEES ENQQEKNEQELLELDKWAS LWNWF - NH2 

1573 AC-YTSLIHSLIEESQDQQEKNEQELLELDKWASLWNWF-NH2 

1574 AC-YTSLIHSLIEESQNEQEKNEQELLELDKWASLWNWF-NH2 

1575 C-YTSLIHSLIEESQNQEEKNEQELLELDKWAS LWNWF -NH2 

1576 AC-YTSLIHSLIEESQNQQEKDEQELLELDKWASLWNWF-NH2 

1577 AC - LGEWYNQTKDLQQKFYEI IMDIEQNNVQGKKGIQQ-NH2 

1578 AC - WYNQTKDLQQKF YE I IMD IEQNNVQGKKGIQQLQK-NH2 

1579 AC-YTSLIHSLIEESQNQQEKNEEELLELDKWASLWNWF-NH2 

1580 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWDWF-NH2 
3 0 1586 AC-XTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWX-NH2 

1588 AC-YNQTKDLQQKFYEIIMDIEQNNVQGKKGIQQLQKW-NH2 

1598 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 



20 



25 
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1600 AC-TLTVQARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQAR-NH2 

1603 AC - LQQKF YE I IMD IEQNNVQGKKGI QQLQKWED WVG W -NH2 

1627 Ac-YTSLIHSLISESQNQQEKNEQELLALDKWASLWNWF-NH2 

1628 Ac - YTSLIHSL IEESQNQQEKNEQELLEADKWASLWNWF -NH2 
162 9 Ac- YTSLIHSLIEESQNQQEKNEQELLELAKWASLWNWF-NH2 

1630 AC-YTSLIHSLIEESQNQQEKAEQELLELDKWASLWNWF-NH2 

1631 Ac - YTS L IHS L IEES QNQQEKNAQELLELDKWASLWNWF -NH2 

1632 AC-YTSLIHSLIEESQNQQEKNEAELLELDKWASLWNWF-NH2 

1634 Ac - WQEWEQKITALLEQAQIQQEKNEQELQKLDKWASLWEWF -NH2 

1635 Ac - WQEWEQKITALLEQAQIQQEKAEYELQKLDKWASLWEWF -NH2 

1636 Ac - WQEWEQKITALLEQAQIQQEKNAYELQKLDKWASLWEWF-NH2 

1637 Ac ~ WQEWEQKITALLEQAQIQQEKNEAELQKLDKWASLWEWF -NH2 

1644 Ac -EYDLRRWEK-NH2 

1645 Ac - EQELLELDK-NH2 
164 6 AC - E YELQKLDK-NH2 

164 7 Ac -WQEWEQKITALLEQAQIQQEKNEQELLKLDKWASLWEWF-NH2 

164 8 AC-WQEWEQKITALLEQAQIQQEKNEQELLELDKWASLWEWF-NH2 

1649 Ac - WQEWEQKITALLEQAQ I QQEKNDKWAS L WEWF - NH2 

1650 AC - YTSLIHSL IEESQNQAEKNEQELLELDKWASLWNWF -NH2 

1651 AC-YTSLIHSLIEESQNQQAKNEQELLELDKWASLWNWF-NH2 

1652 Ac - YTSL IHSLIEESQNQQEANEQELLELDKWASLWNWF -NH2 

1653 Ac - YTSLIHSLIEESANQQEANEQELLELDKWASLWNWF -NH2 

1654 AC-YTSLIHSLIEESQAQQEKNEQELLELPKWASLWNWF-NH2 

1655 AC-YTSLIHSLIEESQNAQEKNEQELLELDKWASLWNWF-NH2 

1656 Ac - YTSLIHALIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1657 Ac - YTS LIHSAIEES QNQQEKNEQELLELDKWASLWNWF - NH2 

1658 AC-VYPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

1659 Ac - YTSLIHSLAEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1660 Ac - YTSAIHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1661 Ac - YTSLAHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1662 AC-YTSLIASLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

1663 AC-ATSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

1664 Ac - YASLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

1665 Ac - YTALIHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1666 Ac - RIQDLEKYVEDTKIDLWS YNAELLVALENQ -NH2 

1667 Ac - HTIDLTDSEMNKLFEKTRRQLREN -NH2 

1668 Ac - SEMNKLFEKTRRQLREN -NH2 

1669 Ac - VFPSDEADAS ISQVNEKINQSLAF IRKSDELLHNV-NH2 

1670 AC-VFPSDEFAASISQVNEKINQSIAFIRKSDELLHNV-NH2 

1671 Ac - VFPSDEFDASISAVNEKINQSLAFIRKSDELLHNV-NH2 

1672 AC - VFPSDEFDAS ISQANEKINQSLAFIRKSDELLHNV-NH2 

1673 AC -VFPSDEFDASISQVAEKINQSLAFIRKSDELLHNV-NH2 

1674 AC -WQEWEQKITAALEQAQIQQEK2IEYELQKLDKWASLWEWF-NH2 

1675 AC -WQEWEQKITALAEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

1676 AC -WQEWEQKITALLEQAAIQQEKNEYELQKLDKWASLWEWF-NH2 

1677 Ac - WQEWEQKITALLEQAQAQQEKWEYELQKLDKWASLWEWF -NH2 

1678 Ac - WQEWEQKITALLEQAQIAQEKNEYELQKLDKWASLWEWF-NH2 

1679 Ac - WQEWEQKITALLEQAQIQAEKNEYELQKLDKWASLWEWF -NH2 

1680 Ac - VFPSDEFDASISQVNEKIWQSAAFIRKSDELLHNV-NH2 

1681 Ac - VFPSDEFDASISQVNEKINQSLAAIRKSDELLHNV-NH2 



- 81 - 



WO 01/51673 



PCT/US00/35727 



T 

No. 



Sequence 



16 82 Ac -VFPSDEFDAS ISQVNEKINQSLAFIRKSDEALHNV-NH2 

1683 AC-VFPSDEFDASISQVNEKINQSLAFIRKSDELAHNV-NH2 

1684 AC- VFPSDEFDAS ISQVNEKINQSLAFIRKSDELLANV-NH2 

1685 AC - WQEWEQKITALLEQAQIQQAKNEYELQKLDKWASLWEWF -NH2 

1687 Ac - WQEWEQKITALLEQAQ I QQEKNE YELQALDKWASLWE WF - NH2 

1688 AC - WQEWEQKITALLEQAQ I Q'QfeKNE YELQKADKWAS LWEWF -NH2 



10 
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5.4. SYNTHESIS OF PEPTIDES 
The peptides of the invention may be synthesized 
or prepared by techniques well known in the art. See, 
for example, Creighton, 1983, Proteins: Structures 
and Molecular Principles, W.H. Freeman and Co., NY, 
5 which is incorporated herein by reference in its 
entirety. Short peptides, for example, can be 
synthesized on a solid support or in solution. Longer 
peptides may be made using recombinant DNA techniques. 
Here, the nucleotide sequences encoding the peptides 
10 of the invention may be synthesized, and/or cloned, 
and expressed according to techniques well known to 
those of ordinary skill in the art. See, for example, 
Sambrook, et al. , 198 9, Molecular Cloning, A 
Laboratory Manual, Vols. 1-3, Cold Spring Harbor 
Press, NY. 

15 

The peptides of the invention may alternatively 
be synthesized such that one or more of the bonds 
which link the amino acid residues of the peptides are 
non-peptide bonds. These alternative non-peptide 
bonds may be formed by utilizing reactions well known 

20 to those in the art, and may include, but are not 

limited to imino, ester, hydrazide, semicarbazide , and 
azo bonds, to name but a few. In yet another 
embodiment of the invention, peptides comprising the 
sequences described above may be synthesized with 

25 additional chemical groups present at their amino 
and/or carboxy termini, such that, for example, the 
stability, bioavailability, and/or inhibitory activity 
of the peptides is enhanced. For example, hydrophobic 
groups such as carbobenzoxyl , dansyl, or t- 

^ butyloxycarbonyl groups, may be added to the peptides 1 
amino termini. Likewise, an acetyl group or a 9- 
f luorenylmethoxy-carbonyl group may be placed at the 
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peptides 1 amino termini. (See "X" Tables I to IV, 
above.) Additionally, the hydrophobic group, t- 
butyloxycarbonyl , or an amido group may be added to 
the peptides' carboxy termini. (See "Z" in Tables I 
to IV, above.) 

Further, the peptides of the invention may be 
synthesized such that their steric configuration is 
altered. For example, the D- isomer of one or more of 
the amino acid residues of the peptide may be used, 
rather than the usual L- isomer. 

Still further, at least one of the amino acid 
residues of the peptides of the invention may be 
substituted by one of the well known non-naturally 
occurring amino acid residues. Alterations such as 
these may serve to increase the stability, 
bioavailability and/or inhibitory action of the 
peptides of the invention. 

Any of the peptides described above may, 
additionally, have a macromolecular carrier group 
covalently attached to their amino and/or carboxy 
termini. Such macromolecular carrier groups may 
include, for example, lipid-fatty acid conjugates, 
polyethylene glycol, carbohydrates or additional 
peptides. "X", in Tables I to IV, above, may 
therefore additionally represent any of the above 
macromolecular carrier groups covalently attached to 
the amino terminus of a peptide, with an additional 
peptide group being preferred. Likewise, u Z n , in 
Tables I to IV, may additionally represent any of the 
macromolecular carrier groups described above. 

5.5. ASSAYS FOR ANTI -MEMBRANE FUSION ACTIVITY 
Described herein, are methods for ability of a 
compound, such as the peptides of the invention, to 
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inhibit membrane fusion events. Specifically, assays 
for cell fusion events are described in Section 5.5.1, 
below, and assays for antiviral activity are described 
in Section 5.5.2, below. 

5 5.5.1 ASSAYS FOR CELL FUSION EVENTS 

Assays for cell fusion events are well known to 
those of skill in the art, and may be used in 
conjunction, for example, with the peptides of the 
invention to test the peptides' antifusogenic 

10 capabilities. 

Cell fusion assays are generally performed in 
vitro. Such an assay may comprise culturing cells 
which, in the absence of any treatment would undergo 
an observable level of syncytial formation. For 
example, uninfected cells may be incubated in the 
presence of cells chronically infected with a virus 
that induces cell fusion. Such viruses may include, 
but are not limited to, HIV, SIV, or respiratory 
syncytial virus. 

For the assay, cells are incubated in the 

20 presence of a peptide to be assayed. For each 

peptide, a range of peptide concentrations may be 
tested. This range should include a control culture 
wherein no peptide has been added. 

Standard conditions for culturing cells, well 

25 known to those of ordinary skill in the art, are used. 
After incubation for an appropriate period (24 hours 
at 3 7°C, for example) the culture is examined 
microscopically for the presence of multinucleated 
giant cells, which are indicative of cell fusion and 
syncytial formation. Well known stains, such as 

30 

crystal violet stain, may be used to facilitate the 
visualization of syncytial formation. 
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5.5.2 ASSAYS FOR ANTIVIRAL ACTIVITY 
The antiviral activity exhibited by the peptides 
of the invention may be measured, for example, by 
easily performed in vitro assays, such as those 
described below, which can test the peptides ■ ability 
5 to inhibit syncytia formation, or their ability to 
inhibit infection by cell-free virus. Using these 
assays, such parameters as the relative antiviral 
activity of the peptides, exhibit against a given 
strain of virus and/or the strain specific inhibitory 

10 activity of the peptide can be determined. 

A cell fusion assay may be utilized to test the 
peptides 1 ability to inhibit viral -induced, such as 
HIV- induced, syncytia formation in vitro . Such an 
assay may comprise culturing uninfected cells in the 
presence of cells chronically infected with a 
syncytial -inducing virus and a peptide to be assayed. 
For each peptide, a range of peptide concentrations 
may be. tested. This range should include a control 
culture wherein no peptide has been added. Standard 
conditions for culturing, well known to those of 

20 ordinary skill in the art, are used. After incubation 
for an appropriate period (24 hours at 37°C, for 
example) the culture is examined microscopically for 
the presence of multinucleated giant cells, which are 
indicative of cell fusion and syncytia formation. 

25 Well known stains, such as crystal violet stain, may 
be used to facilitate syncytial visualization. Taking 
HIV as an example, such an assay would comprise CD-4+ 
cells (such as Molt or CEM cells, for example) 
cultured in the presence of chronically HIV-infected 

3q cells and a peptide to be assayed. 

Other well known characteristics of viral 
infection may also be assayed to test a peptide's 
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antiviral capabilities. Once again taking HIV as an 
example, a reverse transcriptase (RT) assay may be 
utilized to test the peptides 1 ability to inhibit 
infection of CD-4* cells by cell-free HIV. Such an 
assay may comprise culturing an appropriate 
5 concentration ( i.e. , TCID S0 ) of virus and CD-4* cells 
in the presence of the peptide to be tested. Culture 
conditions well known to those in the art are used. 
As above, a range of peptide concentrations may be 
used, in addition to a control culture wherein no 

10 peptide has been added. After incubation for an 
appropriate period ( e.g. , 7 days) of culturing, a 
cell-free supernatant is prepared, using standard 
procedures, and tested for the present of RT activity 
as a measure of successful infection. The RT activity 

^ may be tested using standard techniques such as those 
described by, for example, Goff et al . (Goff , S rf et 
al., 1981, J. Virol. 38:239-248) and/or Willey et_al. 
(Willey, R. et al . , 1988, J. Virol. 62:139-147). 
These references are incorporated herein by reference 
in their entirety. 

20 Standard methods which are well-known to those of 

skill in the art may be utilized for assaying non- 
retroviral activity. See, for example, Pringle et al. 
(Pringle, C.R. et al . , 1985, J. Medical Virology 
3/7:377-386) for a discussion of respiratory syncytial 

25 virus and parainfluenza virus activity assay 

techniques. Further, see, for example, "Zinsser 
Microbiology", 1988, Joklik, W.K. et al., eds., 
Appleton Ec Lange, Norwalk, CT, 19th ed., for a general 
review of such techniques . These references are 

^ incorporated by reference herein in their entirety. 

In addition, the Examples presented below, in Sections 
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17, 18, 26 and 27 each provide additional assays for 
the testing of a compounds antiviral capability. 

In vivo assays may also be utilized to test, for 
example, the antiviral activity of the peptides of the 
invention. To test for anti-HIV activity, for 
5 example, the in vivo model described in Barnett et al. 
(Barnett, S.W. et al., 1994, Science 266 : 642-646) may 
be used. 

Additionally, ant i -RSV activity can be assayed in 
vivo via well known mouse models. For example, RSV 

10 can be administered intranasally to mice of various 
inbred strains. Virus replicates in lungs of all 
strains, but the highest titers are obtained in P/N, 
C57L/N and DBA/2N mice. Infection of BALB/c mice 
produces an asymptomatic bronchiolitis characterized 
by lymphocytic infiltrates and pulmonary virus titers 
of 10 4 to 10 s pfu/g of lung tissue (Taylor, G. et al., 
1984, Infect. Immun. 43.: 649-655) . 

Cotton rat models of RSV are also well known. 
Virus replicates to high titer in the nose and lungs 
of the cotton rat but produces few if any signs of 

20 inflammation. 



5.6. USES OF THE PEPTIDES OF THE INVENTION 
The peptides of the invention may be utilized as 
ant if usogenic or antiviral compounds, or as compounds 

25 which modulate intracellular processes involving 
coiled coil peptide structures. Further, such 
peptides may be used to identify agents which exhibit 
antif usogenic, antiviral or intracellular modulatory 
activity. Still further, the peptides of the 

30 invention ma Y be utilized as organism or viral 
type/subtype-specific diagnostic tools. 
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The antifusogenic capability of the peptides of 
the invention may additionally be utilized to inhibit 
or treat/ameliorate symptoms caused by processes 
involving membrane fusion events. Such events may 
include, for example, virus transmission via cell-cell 
5 fusion, abnormal neurotransmitter exchange via cell- 
fusion, and sperm-egg fusion. Further, the peptides 
of the invention may be used to inhibit free viral, 
such as retroviral, particularly HIV, transmission to 
uninfected cells wherein such viral infection involves 
10 membrane fusion events or involves fusion of a viral 
structure with a cell membrane. Among the 
intracellular disorders involving coiled coil peptides 
structures which may be ameliorated by the peptides of 
the invention are disorders involving, for example, 
bacterial toxins. 

15 

With respect to antiviral activity, the viruses 
whose transmission may be inhibited by the peptides of 
the invention include, but are not limited to human 
retroviruses, such as HIV-1 and HIV- 2 and the human T- 
lymphocyte viruses (HTLV-I and II) , and non-human 

20 retroviruses such as bovine leukosis virus, feline 

sarcoma and leukemia viruses, simian immunodeficiency, 
sarcoma and leukemia viruses, and sheep progress 
pneumonia viruses . 

Non retroviral viruses whose transmission may be 

25 inhibited by the peptides of the invention include, 
but are not limited to human respiratory syncytial 
virus, canine distemper virus, newcastle disease 
virus, human parainfluenza virus, influenza viruses, 
measles viruses, Epstein-Barr viruses, hepatitis B 

^ viruses, and simian Mason-Pfizer viruses. 

Non enveloped viruses whose transmission may be 
inhibited by the peptides of the invention include, 
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but are not limited to picornaviruses such as polio 
viruses, hepatitis A virus, enterovirus, echoviruses 
and coxsackie viruses, papovaviruses such as papilloma 
virus, parvoviruses, adenoviruses and reoviruses. 

As discussed more fully, below, in Section 5.6.1 
5 and in the Example presented, below, in Section 8, 
DP107, DP178, DP107 analog and DP178 analog peptides 
form non-covalent protein-protein interactions which 
are required for normal activity of the virus. Thus, 
the peptides of the invention may also be utilized as 
10 components in assays for the identification of 

compounds that interfere with such protein-protein 
interactions and may, therefore, act as antiviral 
agents. These assays are discussed, below, in Section 
5.6.1. 

As demonstrated in the Example presented below in 
Section 6, the antiviral activity of the peptides of 
the invention may show a pronounced type and subtype 
specificity, i.e. . specific peptides may be effective 
in inhibiting the activity of only specific viruses. 
This feature of the invention presents many 

20 advantages. One such advantage, for example, lies in 
the field of diagnostics, wherein one can use the 
antiviral specificity of the peptide of the invention 
to ascertain the identity of a viral isolate. With 
respect to HIV, one may easily determine whether a 

25 viral isolate consists of an HIV-1 or HIV- 2 strain. 

For example, uninfected CD-4* cells may be co-infected 
with an isolate which has been identified as 
containing HIV the DP178 (SEQ ID:1) peptide, after 
which the retroviral activity of cell supernatants may 

3Q be assayed, using, for example, the techniques 

described above in Section 5.2. Those isolates whose 
retroviral activity is completely or nearly completely 
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inhibited contain HIV-1. Those isolates whose viral 
activity is unchanged or only reduced by a small 
amount, may be considered to not contain HIV-1. Such 
an isolate may then be treated with one or more of the 
other DP178 peptides of the invention, and 
5 subsequently be tested for its viral activity in order 
to determine the identify of the viral isolate. The 
DP107 and DP178 analogs of the invention may also be 
utilized in a diagnostic capacity specific to the type 
and subtype of virus or organism in which the specific 
10 peptide sequence is found. A diagnostic procedure as 
described, above, for DP178, may be used in 
conjunction with the DP107/DP178 analog of interest. 

5.6.1. SCREENING ASSAYS 
^ As demonstrated in the Example presented in 

Section 8, below, DP107 and DP178 portions of the TM 
protein gp4l, i.e., the HRl and HR2 portions of gp4l, 
respectively, form non-covalent protein-protein 
interactions. As is also demonstrated, the 
maintenance of such interactions is necessary for 
20 normal viral infectivity. Thus, compounds which bind 
DP107? bind DP17 8, and/or act to disrupt normal 
DP107/DP178 protein-protein interactions may act as 
antifusogenic, antiviral or cellular modulatory 
agents. Described below are assays for the 
25 identification of such compounds. Note that, while, 
for ease and clarity of discussion, DP107 and DP178 
peptides will be used as components of the assays 
described, but it is to be understood that any of the 
DP107 analog or DP178 analog peptides described, 
above, in Sections 5.1 through 5.3 may also be 
utilized as part of these screens for compounds. 
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For example, in certain embodiments the assays of 
the invention may be use DP107 and/or DP178 analogs 
that contain one or more amino acid residue 
truncations, deletions, insertions or substitutions. 
In particular, in one preferred embodiment, the DP107, ' 
5 DP178, DP107-like and DP178-like peptides can comprise 
amino and/or carboxy- terminal insertions corresponding 
to about two to about fifty amino acids amino -to or 
carboxy- to the endogenous sequence from which the 
DP107, DP178, DP107-like or DP178-like peptide is 

10 derived. In another particular embodiment, the 

peptides used in the assays described herein further 
comprise additional, heterologous sequence useful for 
detecting, immobilizing and/or purifying the 
particular peptide. Such heterologous sequences 
include, but are not limited to maltose binding fusion 
proteins containing a DP178, DP107, DP178-like Qr 
DPl07-like sequence such as the M41A178 and MF5.1 
maltose binding fusion proteins described in Sections 
8 and 30, below. 

In certain embodiments, such analogs will have 

20 reduced binding affinities and are therefore useful, 
e.g., to screen for compounds which inhibit the 
formation of or, alternatively, disrupt complexes 
between DP107/DP178 complexes. Among such reduced 
binding analogs are peptides exhibiting one or more 

25 alanine insertion or substitutions, including, e.g., 
the peptides described in the examples presented in 
Sections 3 0 and 31, below. It is understood that such 
analogs which have reduced binding affinities, 
including the analogs described in Sections 3 0 and 31 
below, are also part of the present invention. 
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Compounds which may be tested for an ability to 
bind DP107, DP178, and/or disrupt DP107/DP178 
interactions, and which therefore, potentially 
represent antifusogenic, antiviral or intracellular 
modulatory compounds, include, but are not limited to, 1 
5 peptides made of D- and/or L- configuration amino acids 
(in, for example, the form of random peptide 
libraries; see Lam, K.S. et al . , 1991, Nature 354 : 82- 
84), phosphopep tides (in, for example, the form of 
random or partially degenerate, directed 
10 phosphopeptide libraries; see, for example, Songyang, 
Z. et al . , 1993, Cell 72:767-778), antibodies, and 
small organic or inorganic molecules. Synthetic 
compounds, natural products, and other sources of 
potentially effective materials may be screened in a 
variety of ways, as described in this Section. 

Compounds that can be screened, tested and ^ 
identified as modulating HR1/HR2, DP178/DP107 and/or 
DP178-like/DP107-like interactions utilizing the 
methods described herein can, in general, include, 
e.g., small molecules that are of a molecular weight 
up to about 1500 daltons. Test compounds, including 
small molecules, can include, but are not limited to, 
compounds obtained from any commercial source, 
including Aldrich (1001 West St. Paul Ave., Milwaukee, 
WI 53233), Sigma Chemical (P.O. Box 14508, St. Louis, 
MO 63178), Fluka Chemie AG (Industriestrasse 25, CH- 
9471 Buchs, Switzerland (Fluka Chemical Corp. 980 
South 2nd Street, Ronkonkoma, NY 11779)), Eastman 
Chemical Company, Fine Chemicals (P.O Box 431, 
Kingsport, TN 37662), Boehringer Mannheim GmbH 
(Sandhofer Strasse 116, D-68298 Mannheim), Takasago (4 
Volvo Drive, Rockleigh, NJ 07647), SST Corporation 
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(635 Brighton Road, Clifton, NJ 07012), Ferro (111 
West Irene Road, Zachary, LA 70791) , Riedel-deHaen 
Aktiengesellschaf t (P.O. Box D-30918, Seelze, 
Germany), PPG Industries Inc., Fine Chemicals (One PPG 
Place, 34th Floor, Pittsburgh, PA 15272) . Further any 
kind of natural products may be screened using the 
methods of the invention, including microbial, fungal 
or plant extracts. 

Furthermore, diversity libraries of test 
compounds, including small molecule test compounds, 
may be commercially obtained from Specs and BioSpecs 
B.V. (Rijswijk, The Netherlands) , Chembridge 
Corporation (San Diego, CA) , Contract Service Company 
(Dolgoprudny, Moscow Region, Russia) , Comgenex USA 
Inc. (Princeton, NJ) , Maybridge Chemicals Ltd. 
(Cornwall PL34 OHW, United Kingdom) , and Asinex 
(Moscow, Russia) . Combinatorial libraries of test 
compounds, including small molecule test compounds, 
can be may be generated as disclosed in Eichler & 
Houghten, 1995, Mol. Med. Today 1: 174-180 ; Dolle, 
1997, Mol. Divers. 2:223-236; Lam, 1997, Anticancer 
Drug Des. 12.: 145-167. These references are 
incorporated hereby by reference in their entirety. 
It is to be noted that such references also teach 
additional screening methods which may be employed for 
the further testing of compounds identified via the 
methods of the invention and which can aid in 
identifying and isolating compounds which can 
represent leads and therapeutic compounds. 
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The compounds, antibodies, or other molecules 
identified may be tested, for example, for an ability 
to inhibit cell fusion or viral activity, utilizing, 
for example, assays such as those described, above, in 
Section 5.5. 

5 Among the peptides which may be tested are 

soluble peptides comprising DP107 and/or DP178 
domains, and peptides comprising DP107 and/or DP178 
domains having one or more mutations within one or 
both of the domains, such as the M41-P peptide 
10 described, below, in the Example presented in Section 
8, which contains a isoleucine to proline mutation 
within the DP178 sequence. 

In one embodiment of such screening methods is a 
method for identifying a compound to be tested for 
antiviral ability comprising: 

1 exposing at least one compound ta a 
peptide comprising a DP107 peptide for a time 
sufficient to allow binding of the compound to the 
DP107 peptide; 

2 removing non-bound compounds; and 
20 3 determining the presence of the 

compound bound to the DP107 peptide, 

thereby identifying an agent to be tested for 

antiviral ability. 

In a second embodiment of such screening methods 
25 is a method for identifying a compound to be tested 
for antiviral ability comprising: 

(a) exposing at least one compound to a 
peptide comprising a DP178 peptide for a time 
sufficient to allow binding of the compound to the 
DP178 peptide; 

(b) removing non-bound compounds; and 
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(c) determining the presence of the 
compound bound to the DP178 peptide, 
thereby identifying an agent to be tested for 
antiviral ability. 

One method utilizing these types of approaches 
5 that may be pursued in the isolation of such DP107- 
binding or DP178 -binding compounds is an assay which 
would include the attachment of either the DP107 or 
the DP178 peptide to a solid matrix*, such as, for 
example, agarose or plastic beads, microtiter plate 

10 wells, petri dishes, or membranes composed of, for 
example, nylon or nitrocellulose. In such an assay 
system, either the DP107 or DP178 protein may be 
anchored onto a solid surface, and the compound, or 
test substance, which is not anchored, is labeled, 
either directly or indirectly {e.g., with a 
radioactive label such as 125 I, an absorption lab^el 
such as biotin, or a fluorescent label such as 
fluorescein or rhodamine) . In practice, microtiter 
plates are conveniently utilized. The anchored 
component may be immobilized by non-covalent or 

20 covalent attachments. Non-covalent attachment may be 
accomplished simply by coating the solid surface with 
a solution of the protein and drying. Alternatively, 
an immobilized antibody, preferably a monoclonal 
antibody, specific for the protein may be used to 

25 anchor the protein to the solid surface. The surfaces 
may be prepared in advance and stored. 

In order to conduct the assay, the labeled 
compound is added to the coated surface containing the 
anchored DP107 or DP178 peptide. After the reaction 

30 com P lete ' unreacted components are removed ( e.g. , 

by washing) under conditions such that any complexes 
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formed will remain immobilized on the solid surface. 
The detection of complexes anchored on the solid 
surface can be accomplished in a number of ways. 
Where the compound is pre -labeled, the detection of 
label immobilized on the surface indicates that 
5 complexes were formed. Where the labeled component is 
not pre -labeled, an indirect label can be used to 
detect complexes anchored on the surface; e.g. , using 
a labeled antibody specific for the compound (the 
antibody, in turn, may be directly labeled or 

10 indirectly labeled with a labeled anti-Ig antibody) . 

Alternatively, such an assay can be conducted in 
a liquid phase, the reaction products separated from 
unreacted components, and complexes detected; e.g. , 
using an immobilized antibody specific for DP107 or 

^ DP178, whichever is appropriate for the given assay, 
or ab antibody specific for the compound, i.e. , the 
test substance, in order to anchor any complexes 
formed in solution, and a labeled antibody specific 
for the other member of the complex to detect anchored 
complexes . 

20 By utilizing procedures such as this, large 

numbers of types of molecules may be simultaneously 
screened for DP107 or DP178 -binding capability, and 
thus potential antiviral activity. 

Further, compounds may be screened for an ability 

25 to inhibit the formation of or, alternatively, disrupt 
DP107/DP178 complexes. Such compounds may then be 
tested for antifusogenic, antiviral or intercellular 
modulatory capability. For ease of description, DP107 
and DP178 will be referred to as "binding partners." 

^ Compounds that disrupt such interactions may exhibit 
antiviral activity. Such compounds may include, but 
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are not limited to molecules such as antibodies, 
peptides, and the like described above. 

The basic principle of the assay systems used to 
identify compounds that interfere with the interaction 
between the DP107 and DP178 peptides involves 
5 preparing a reaction mixture containing peptides under 
conditions and for a time sufficient to allow the two 
peptides to interact and bind, thus forming a complex. 
In order to test a compound for disruptive activity, 
the reaction is conducted in the presence and absence 
10 of the test compound, i.e. , the test compound may be 
initially included in the reaction mixture, or added 
at a time subsequent to the addition of one of the 
binding partners; controls are incubated without the 
test compound or with a placebo. The formation of any 
complexes between the binding partners is then 
detected. The formation of a complex in the control 
reaction, but not in the reaction mixture containing 
the test compound indicates that the compound 
interferes with the interaction of the DP107 and DP178 
peptides . 

20 The assay for compounds that interfere with the 

interaction of the binding partners can be conducted 
in a heterogeneous or homogeneous format . 
Heterogeneous assays involve anchoring one of the 
binding partners onto a solid phase and detecting 

25 complexes anchored on the solid phase at the end of 
the reaction. In homogeneous assays, the entire 
reaction is carried out in a liquid phase. In either 
approach, the order of addition of reactants can be 
varied to obtain different information about the 
compounds being tested. For example, test compounds 
that interfere with the interaction between the 
binding partners, e.g. r by competition, can be 
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identified by conducting the reaction in the presence 
of the test substance; i.e. , by adding the test 
substance to the reaction mixture prior to or 
simultaneously with the binding partners. On the 
other hand, test compounds that disrupt preformed 
5 complexes, e.g. compounds with higher binding 

constants that displace one of the binding partners 
from the complex, can be tested by adding the test 
compound to the reaction mixture after complexes have 
been formed. The various formats are described 

10 briefly below. 

In a heterogeneous assay system, one binding 
partner, e.g. , either the DP107 or DP178 peptide, is 
anchored onto a solid surface, and its binding 
partner, which is not anchored, is labeled, either 

^ directly or indirectly (e.g., with a radioactive label 
such as 125 I, an absorption label such as biotin, or a 
fluorescent label such as fluorescein or rhodamilae) . 
In practice, microtiter plates are conveniently 
utilized. The anchored species may be immobilized by 
non-covalent or covalent attachments. Non-covalent 

20 attachment may be accomplished simply by coating the 
solid surface with a solution of the protein and 
drying. Alternatively, an immobilized antibody 
specific for the protein may be used to anchor the 
protein to the solid surface. The surfaces may be 

25 prepared in advance and stored. 

In order to conduct the assay, the binding 
partner of the immobilized species is added to the 
coated surface with or without the test compound. 
After the reaction is complete, unreacted components 
are removed ( e.g. , by washing) and any complexes 

30 

formed will remain immobilized on the solid surface. 
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The detection of complexes anchored on the solid 
surface can be accomplished in a number of ways. 
Where the binding partner was pre -labeled, the 
detection of label immobilized on the surface 
indicates that complexes were formed. Where the 
5 binding partner is not pre-labeled, an indirect label 
can be used to detect complexes anchored on the 
surface; e.g. , using a labeled antibody specific for 
the binding partner (the antibody, in turn, may be 
directly labeled or indirectly labeled with a labeled 

10 anti-Ig antibody) . Depending upon the order of 

addition of reaction components, test compounds which 
inhibit complex formation or which disrupt preformed 
complexes can be detected. 

Alternatively, the reaction can be conducted in a 

^ liquid phase in the presence or absence of the test 
compound, the reaction products separated from 
unreacted components, and complexes detected; e.g. , 
using an immobilized antibody specific for one binding 
partner to anchor any complexes formed in solution, 
and a labeled antibody specific for the other binding 

20 partner to detect anchored complexes. Again, 

depending upon the order of addition of react ants to 
the liquid phase, test compounds which inhibit complex 
or which disrupt preformed complexes can be 
identified. 

25 In an alternate embodiment of the invention, a 

homogeneous assay can be used. In this approach, a 
preformed complex of the DP107 and DP178 peptides is 
prepared in which one of the binding partners is 
labeled, but the signal generated by the label is 

3q quenched due to complex formation (see, e.g. , U.S. 

Patent No. 4,109,496 by Rubenstein which. utilizes this 
approach for immunoassays) . The addition of a test 
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substance that competes with and displaces one of the 
binding partners from the preformed complex will 
result in the generation of a signal above background. 
In this way, test substances which disrupt DP- 107/ 
DP-178 protein-protein interaction can be identified. 
5 In still another embodiment of the invention, 

fluorescence polarization may be used in a homogenous 
assay. In this approach, complex formation is 
detected by measuring the polarization of a 
f luorescently labeled peptide (e.g., with fluorescein 

10 or rhodamine) in a sample. Binding of the peptide to 
its complementary HR1 or HR2 binding domain in a 
larger molecular weight peptide or protein, such as in 
a maltose binding fusion protein described herein, 
alters the correlation time of the fluorescent moiety 

^ and thereby decreases the fluroescence polarization of 
the labeled peptide. 

In an alternative screening assay, test compounds 
may be assayed for the their ability to disrupt a 
DP178/DP107 interaction, as measured immunometrically 
using an antibody specifically reactive to a 

20 DP107/DP178 complex ( i.e. , an antibody that recognizes 
neither DP107 nor DP178 individually) . Such an assay 
acts as a competition assay, and is based on 
techniques well known to those of skill in the art. 

The above competition assay may be described, by 

25 way of example, and not by way of limitation, by using 
the DP178 and M41A178 peptides and by assaying test 
compounds for the disruption of the complexes formed 
by these two peptides by immunometrically visualizing 
DP178/M41A178 complexes via the human recombinant Fab, 

3q Fab-d, as described, below, in the Example presented 
in Section 8. M41A178 is a maltose binding fusion 
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protein containing a gp4l region having its DP178 
domain deleted, and is described, below, in the 
Example presented in Section 8. 

Utilizing such an assay, M41A178 may be 
immobilized onto solid supports such as microtiter 
5 wells. A series of dilutions of a test compound may 
then be added to each M41A178 -containing well in the 
presence of a constant concentration of DP- 178 
peptide. After incubation, at, for example, room 
temperature for one hour, unbound DP- 178 and test 

10 compound are removed from the wells and wells are then 
incubated with the DP178/M41A178 -specif ic Fab-d 
antibody. After incubation and washing, unbound Fab-d 
is removed from the plates and bound Fab-d is 
quantitated. A no-inhibitor control should also be 
conducted. Test compounds showing an ability to 
disrupt DP178/M41A178 complex formation are identified 
by their concentration-dependent decrease in the level 
of Fab-d binding. 

A variation of such an assay may be utilized to 
perform a rapid, high- throughput binding assay which 

20 is capable of directly measuring DP178 binding to 

M41A178 for the determination of binding constants of 
the ligand of inhibitory constants for competitors of 
DP178 binding. 

Such an assay takes advantage of accepted 

25 radioligand and receptor binding principles. (See, 
for example, Yamamura, H.I. et al., 1985, 
"Neurotransmitter Receptor Binding" , 2nd ed., Raven 
Press, NY.) As above, M41A178 is immobilized onto a 
solid support such as a microtiter well. DP178 

3Q binding to M41A178 is then quantitated by measuring 
the fraction of DP178 that is bound as 12S I-DP178 and 
calculating the total amount bound using a value for 

- 102 - 



WO 01/51673 



PCT/US00/35727 



specific activity (dpm/^g peptide) determined for each 
labeled DP178 preparation. Specific binding to 
M41A178 is defined as the difference of the binding of 
the labeled DP178 preparation in the microtiter wells 
(totals) and the binding in identical wells 
5 containing, in addition, excess unlabeled DP17 8 
(nonspecif ics) . 

Because the binding affinity for native DP178 and 
DP107 is very high (including native DP178-like and 
DPl07-like peptides from other species; e.g., 10 nM 
10 for DP178 in HIV-l, and 2 nM for T112 in RSV) , test 
compounds must exhibit high binding properties to 
interfere with or disrupt the DP178/DP107 binding 
interaction. Accordingly, in another non-limiting 
example of the above-described competitions assays, 
such assays can be performed using "modified" DP107 

15 

and/or DP178 peptides (e.g., DP107 and/or DP178 
analogs) which have reduced binding affinities 
relatived to the unmodified "parent peptides". The 
use of such modified DP107 and DP178 peptides greatly 
increases the sensitivity of the competition assays of 

20 the invention by identifying more compounds with 
inhibitory potential. The binding affinities of 
compounds identified in the assays can then be 
optimized, e.g., using standard medicinal chemistry 
techniques, to generate compounds that are more 

25 powerful inhibitors of DP107/DP178 complex formation 
and are therefore useful, e.g., as antiviral reagents. 
Alternatively, compounds identified in the competition 
assays using DP107 and/or DP17 8 analogs with reduced 
binding affinities may, themselves, be useful, e.g., 
as antiviral reagents. 
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The term "reduced affinity, " as used herein, 
refers to a DP107, DP178, DP107-like or DP178-like 
peptide that interacts with and forms a DP107/DP178 
peptide pair, a HR1/DP178 pair or an HR2/DP107 pair 
under competition assay conditions, but interacts with, 
5 its "partner" to form such a pair with a lower 

affinity than would a DP107 or DP178 "parent" peptide 
from which the reduced affinity peptide is derived. 

Generally, the binding affinity of a peptide can 

be expressed as a B 50 value, i.e., the concentration of 

xo peptide necessary for 50% of the peptide molecules to 

bind to their target under a given set of conditions. 

Preferably, the B so value of a reduced affinity peptide 

will by at least twice, and more preferably at least 

five times, at least 10 times, at least 20 times, or 

at least 100 times the B so value of the unmodified 
15 50 

peptide from which it was derived. 

Modified DP107 and DP178 peptides that have 
reduced binding affinities may be generated according 
to any number of techniques that will be readily 
apparent to those skilled in the art. For example, in 

20 one embodiment modified DP107 and DP178 peptides with 
reduced binding affinities may be generated by 
generating truncated DP107 and DP178 peptides, 
respectively. Such peptides may be routinely 
synthesized and tested, e.gr., by the above described 

25 screening assays, to determine their binding 

affinities to their target. For example, as described 
in the example presented below in Section 30, reducing 
the length of the native RSV DP178-like peptide T112 
from 35 to 2 8 amino acid residues resulted in a five 
fold drop in binding affinity (from 1 nM to 5 nM) . 
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Generally, such truncation can be of 1, 2,3,4, 5, 6, 
7, 8, 9 or 10 amino acid residues. 

Alternatively, modified DP107 and DP178 peptides 
with reduced binding affinity may be identified and 
generated by identifying and substituting, inserting 
5 or deleting amino acid residues. For example in one 
embodiment, which is also demonstrated in the example 
presented below in Section 30, modified DP107 and/or 
DP178 peptides may be routinely synthesized and 
assayed for reduced binding affinity by systematically 

10 replacing one or more amino acid residues of the 

native DP107 or DP178 peptide with other amino acid 
residues and testing the binding affinity of the 
resulting peptide by techniques such as those 
described herein. Preferably, the substituted amino 
acid residues are neutral amino acid residues 
exhibiting relatively small side chains, such as, 
alanine or glycine. 

Such substitutions can identify "key" amino acid 
residues and can be used in the competition assays of 
the invention. Alternatively, upon identification of 

20 key residues by such systematic substitutions, the key 
residues can be changed to other residues and the 
resulting, modified peptides can be tested for binding 
affinity. 

Modified DP107 and/or DP178 peptides that have 
25 reduced binding affinities may still further be 

identified using principles of protein chemistry and 
design that are well known to those of skill in the 
art. Specifically, such principles may be used to 
identify those amino acid residues of a native DP107 
3Q or DP178 sequence that effect, e.gr., solubility, 

binding affinity, or stability of the peptide. Thus, 
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for example, using known principles of amino acid 
chemistry and protein design one skilled in the art 
could identify amino acid residues in a native DP107 
or DP178 peptide that affect the structure of the 
peptide. 

5 

5.7 PHARMACEUTICAL FORMULATIONS, DOSAGES 
AND MODES OF ADMINISTRATION 

The peptides of the invention may be administered 

using techniques well known to those in the art. 

Preferably, agents are formulated and administered 

10 

systemically. Techniques for formulation and 
administration may be found in "Remington's 
Pharmaceutical Sciences", 18th ed., 1990, Mack 
Publishing Co., Easton, PA. Suitable routes may 
include oral, rectal, transmucosal, or intestinal 

15 administration; parenteral delivery, including 
intramuscular, subcutaneous, intramedullary 
injections, as well as, intrathecal, direct 
intraventricular,' intravenous, intraperitoneal, 
intranasal, or intraocular injections, just to name a 

20 few. For injection, the agents of the invention may 
be formulated in aqueous solutions, preferably in 
physiologically compatible buffers such as Hanks' 
solution, Ringer's solution, or physiological saline 
buffer. For such transmucosal administration, 
penetrants appropriate to the barrier to be permeated 
are used in the formulation. Such penetrants are 
generally known in the art. 

In instances wherein intracellular administration 
of the peptides of the invention or other inhibitory 
agents is preferred, techniques well known to those of 

30 ordinary skill in the art may be utilized. For 
example, such agents may be encapsulated into 
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liposomes, then administered as described above. 
Liposomes are spherical lipid bilayers with aqueous 
interiors . All molecules present in an aqueous 
solution at the time of liposome formation are 
incorporated into the aqueous interior. The liposomal 
5 contents are both protected from the external 

microenvironment and, because liposomes fuse with cell 
membranes, are effectively delivered into the cell 
cytoplasm. Additionally, due to their hydrophobicity, 
when small molecules are to be administered, direct 

10 intracellular administration may be achieved. 

Nucleotide sequences encoding the peptides of the 
invention which are to be intracellularly administered 
may be expressed in cells of interest, using 
techniques well known to those of skill in the art. 

^ For example, expression vectors derived from viruses 
such as retroviruses, vaccinia viruses, adeno- 
associated viruses, herpes viruses, or bovine 
papilloma viruses, may be used for delivery and 
expression of such nucleotide sequences into the 
targeted cell population. Methods for the 

20 construction of such vectors and expression constructs 
are well known. See, for example, Sambrook et al., ■ 
1989, Molecular Cloning, A Laboratory Manual, Cold 
Spring Harbor Press, Cold Spring Harbor NY, and 
Ausubel et al., 1989, Current Protocols in Molecular 

25 Biology, Greene Publishing Associates and Wiley 
Interscience, NY. 

With respect to HIV, peptides of the invention, 
particularly DP107 and DP178, may be used as 
therapeutics in the treatment of AIDS. In addition, 
the peptides may be used as prophylactic measures in 
previously uninfected individuals after acute exposure 
to an HIV virus. Examples of such prophylactic use of 
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the peptides may include, but are not limited to, 
prevention of virus transmission from mother to infant 
and other settings where the likelihood of HIV 
transmission exists, such as, for example, accidents 
in health care settings wherein workers are exposed to 
5 HIV- containing blood products. The successful use of 
such treatments do not rely upon the generation of a 
host immune response directed against such peptides. 

Effective dosages of the peptides of the 
invention to be administered may be determined through 
!0 procedures well known to those in the art which 
address such parameters as biological half -life, 
bioavailability, and toxicity. Given the data 
presented below in Section 6, DP178, for example, may 
prove efficacious in vivo at doses required to achieve 
circulating levels of about 1 to about 10 ng per ml of 

15 

peptide. 

A therapeutically effective dose refers to that 
amount of the compound sufficient to result in 
. amelioration of symptoms or a prolongation of survival 
in a patient. Toxicity and therapeutic efficacy of 

20 such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or 
experimental animals, e.g. , for determining the LD S0 
(the dose lethal to 50% of the population) and the 
ED50 (the dose therapeutically effective in 50% of the 

25 population) . The dose ratio between toxic and 

therapeutic effects is the therapeutic index and it 
can be expressed as the ratio LD 50 /ED S0 . Compounds 
which exhibit large therapeutic indices are preferred. 
The data obtained from these cell culture assays and 
animal studies can be used in formulating a range of 

30 

dosage for use in humans. The dosage of such 
compounds lies preferably within a range of 
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circulating concentrations that include the ED50 with 
little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and 
the route of administration utilized. For any 
compound used in the method of the invention, the 
5 therapeutically effective dose can be estimated 
initially from cell culture assays. A dose may be 
formulated in animal models to achieve a circulating 
plasma concentration range that includes the IC S0 
( e.g. , the concentration of the test compound which 
!0 achieves a half -maximal inhibition of the fusogenic 
event, such as a half -maximal inhibition of viral 
infection relative to the amount of the event in the 
absence of the test compound) as determined in cell 
culture. Such information can be used to more 
accurately determine useful doses in humans. Levels 

15 

in plasma may be measured, for example, by high - 
performance liquid chromatography (HPLC) . 

The peptides of the invention may, further, serve 
the role of a prophylactic vaccine, wherein the host 
raises antibodies against the peptides of the 

20 invention, which then serve to neutralize HIV viruses 
by, for example, inhibiting further HIV infection. 

Administration of the peptides of the invention 
as a prophylactic vaccine, therefore, would comprise 
administering to a host a concentration of peptides 

25 effective in raising an immune response which is 
sufficient to neutralize HIV, by, for example, 
inhibiting HIV ability to infect cells. The exact 
concentration will depend upon the specific peptide to 
be administered, but may be determined by using 

^ standard techniques for assaying the development of an 
immune response which are well known to those of 
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15 



ordinary skill in the art. The peptides to be used as 
vaccines are usually administered intramuscularly. 

The peptides may be formulated with a suitable 
adjuvant in order to enhance the immunological 
response. Such adjuvants may include, but are not 
5 limited to mineral gels such as aluminum hydroxide; 
surface active substances such as lysolecithin, 
pluronic polyols, polyanions; other peptides; oil 
emulsions; and potentially useful human adjuvants such 
as BCG and Corynebacterium parvum. Many methods may 
10 be used to introduce the vaccine formulations 

described here. These methods include but are not 
limited to oral, intradermal, intramuscular, 
intraperitoneal, intravenous, subcutaneous, and 
intranasal routes. 

Alternatively, an effective concentration of 
polyclonal or monoclonal antibodies raised against the 
peptides of the invention may be administered to a 
host so that no uninfected cells become infected by 
HIV. The exact concentration of such antibodies will 
vary according to each specific antibody preparation, 
but may be determined using standard techniques well 
known to those of ordinary skill in the art. 
Administration of the antibodies may be accomplished 
using a variety of techniques, including, but not 
limited to those described in this section. 
25 For all such treatments described above, the 

exact formulation, route of administration and dosage 
can be chosen by the individual physician in view of 
the patient's condition. (See e.g. Fingl et al., 
1975, in "The Pharmacological Basis of Therapeutics", 
Ch. 1 pi) . 

It should be noted that the attending physician 
would know how to and when to terminate, interrupt, or 
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adjust administration due to toxicity, or to organ 
dysfunctions. Conversely, the attending physician 
would also know to adjust treatment to higher levels 
if the clinical response were not adequate (precluding 
toxicity) . The magnitude of an administrated dose in 
5 the management of the oncogenic disorder of interest 
will vary with the severity of the condition to be 
treated and the route of administration. The dose and 
perhaps dose frequency, will also vary according to 
the age, body weight, and response of the individual 

10 patient. A program comparable to that discussed above 
may be used in veterinary medicine. 

Use of pharmaceutically acceptable carriers to 
formulate the compounds herein disclosed for the 
practice of the invention into dosages suitable for 
systemic administration is within the scope of the 
invention. With proper choice of carrier and suitable 
manufacturing practice, the compositions of the 
present invention, in particular, those formulated as 
solutions, may be administered parenterally, such as 
by intravenous injection. The compounds can be 

20 formulated readily using pharmaceutically acceptable 
carriers well known in the art into dosages suitable 
for oral administration. Such carriers enable the 
compounds of the invention to be formulated as 
tablets, pills, capsules, liquids, gels, syrups, 

25 slurries, suspensions and the like, for oral ingestion 
by a patient to be treated. 

Pharmaceutical compositions suitable for use in 
the present invention include compositions wherein the 
active ingredients are contained in an effective 
amount to achieve its intended purpose. Determination 
of the effective amounts is well within the capability 
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15 



of those skilled in the art, especially in light of 
the detailed disclosure provided herein. 

In addition to the active ingredients, these 
pharmaceutical compositions may contain suitable 
pharmaceutically acceptable carriers comprising 
5 excipients and auxiliaries which facilitate processing 
of the active compounds into preparations which can be 
used pharmaceutically. The preparations formulated 
for oral administration may be in the form of tablets, 
dragees, capsules, or solutions. 
10 The pharmaceutical compositions of the present 

invention may be manufactured in a manner that is 
itself known, e.g. . by means of conventional mixing, 
dissolving , granulat ing , dragee -making , levigating , 
emulsifying, encapsulating, entrapping or lyophilizing 
processes . 

Pharmaceutical formulations for parenteral * 
administration include aqueous solutions of the active 
compounds in water-soluble form. Additionally, 
suspensions of the active compounds may be prepared as 
appropriate oily injection suspensions. Suitable 
lipophilic solvents or vehicles include fatty oils 
such as sesame oil, or synthetic fatty acid esters, 
such as ethyl oleate or triglycerides, or liposomes. 
Aqueous injection suspensions may contain substances 
which increase the viscosity of the suspension, such 
25 as sodium carboxymethyl cellulose, sorbitol, or 

dextran. Optionally, the suspension may also contain 
suitable stabilizers or agents which increase the 
solubility of the compounds to allow for the 
preparation of highly concentrated solutions. 

Pharmaceutical preparations for oral use can be 
obtained by combining the active compounds with solid 
excipient, optionally grinding a resulting mixture, 
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and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or 
dragee cores. Suitable excipients are, in particular, 
fillers such as sugars, including lactose, sucrose, 
mannitol, or sorbitol; cellulose preparations such as, 
5 for example, maize starch, wheat starch, rice starch, 
potato starch, gelatin, gum tragacanth, methyl 
cellulose , hydroxypropylmethyl- cellulose , sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone 
(PVP) . If desired, disintegrating agents may be 
!0 added, such as the cross-linked polyvinyl pyrrolidone, 
agar, or alginic acid or a salt thereof such as sodium 
alginate . 

Dragee cores are provided with suitable coatings. 
For this purpose, concentrated sugar solutions may be 

^ used, which may optionally contain gum arabic, talc, 
polyvinyl pyrrolidone, carbopol gel, polyethylene 
glycol, and/or titanium dioxide, lacquer solutions, 
and suitable organic solvents or solvent mixtures. 
Dyestuffs or pigments may be added to the tablets or 
dragee coatings for identification or to characterize 

20 different combinations of active compound doses. 

Pharmaceutical preparations which can be used 
orally include push- fit capsules made of gelatin, as 
well as soft, sealed capsules made of gelatin and a 
plasticizer, such as glycerol or sorbitol. The 

25 push- fit capsules can contain the active ingredients 
in admixture with filler such as lactose, binders such 
as starches, and/or lubricants such as talc or 
magnesium stearate and, optionally, stabilizers. In 
soft capsules, the active compounds may be dissolved 

^ or suspended in suitable liquids, such as fatty oils, 
liquid paraffin, or liquid polyethylene glycols. In 
addition, stabilizers may be added. 
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6. EXAMPLE: DP178 (SEQ ID:1) IS A POTENT 
INHIBITOR OF HIV-1 INFECTION 

In this example, DP178 (SEQ ID:1) is shown to be 

a potent inhibitor of HIV-1 mediated CD-4* cell -cell 

fusion and infection by cell free virus. In the 

5 fusion assay, this peptide completely blocks virus 

induced syncytia formation at concentrations of from 

1-10 ng/ml. In the infectivity assay the inhibitory 

concentration is somewhat higher, blocking infection 

at 90ng/ml. It is further shown that DP178 (SEQ ID:1) 

shows that the antiviral activity of DP178 (SEQ ID:1) 

.0 

is highly specific for HIV-1. Additionally, a 
synthetic peptide, DP-185 (SEQ ID:3), representing a 
HIV- 1- derived DP178 homolog is also found to block 
HIV-l-mediated syncytia formation. 

6.1- MATERIALS AND METHODS 

6.1.1. PEPTIDE SYNTHESIS 
Peptides were synthesized using Fast Moc 
chemistry on an Applied Biosystems Model 43 1A peptide 
synthesizer. Generally, unless otherwise noted, the 
peptides contained amidated carboxy termini and 
acetylated amino termini. Amidated peptides were 
prepared using Rink resin (Advanced Chemtech) while 
peptides containing free carboxy termini were 
synthesized on Wang (p-alkoxy-benzyl -alcohol) resin 
(Bachem) . First residues were double coupled to the 
appropriate resin and subsequent residues were single 
coupled. Each coupling step was followed by acetic 
anhydride capping. Peptides were cleaved from the 
resin by treatment with trifluoracetic acid (TFA) 
(10ml), H 2 0 (0.5ml), thioanisole (0.5ml), ethanedithiol 
(0.25ml), and crystalline phenol (0.75g). Purifi- 
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cation was carried out by reverse phase HPLC. 
Approximately 50mg samples of crude peptide were 
chromatographed on a Waters Delta. Pak C18 column (19mm 
x 3 0cm, 15/i spherical) with a linear gradient; 
H 2 0/acetonitrile 0.1% TFA. Lyophilized peptides were 4 
5 stored desiccated and peptide solutions were made in 
water at about img/ml. Electrospray mass spectrometry 
yielded the following results: DP178 {SEQ 
ID:1) :4491.87 (calculated 4491 . 94 ) ; DP-180 (SEQ 
ID:2) :4491.45 (calculated 4491 . 94 ) ; DP-185 (SEQ 
10 ID: 3) mot done (calculated 4546.97). 



15 
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6.1.2. VIRUS 
The HIV-Ijaj virus was obtained from R. Gallo 
(Popovic, M. et al. , 1984, Science 224:497-508) and 
propagated in CEM cells cultured in RPMI 1640 
5 containing 10% fetal calf serum. Supernatant from the 
infected CEM cells was passed through a 0.2/im filter 
and the infectious titer estimated in a 
microinfectivity assay using the AA5 cell line to 
support virus replication. For this purpose, 25/xl of 

10 serial diluted virus was added to ISfil AA5 cells at a 
concentration of 2 x 10 5 /ml in a 96 -well microtitre 
plate. Each virus dilution was tested in triplicate. 
Cells were cultured for eight days by addition of 
fresh medium every other day. On day 8 post 

^ infection, supernatant samples were tested for virus 
replication as evidenced by reverse transcriptase 
activity released to the supernatant. The TCID 50 was 
calculated according to the Reed and Muench formula 
(Reed, L.J. et al . , 1938, Am. J. Hyg. 27:493-497). 
The titer of the HIV-l^ and HIV-1^ stocks used for 

20 these studies, as measured on the AA5 cell line, was 
approximately 1.4 x 10 6 and 3.8 x 10 4 TCID 50 /ml, 
respectively. 

6.1.3. CELL FUSION ASSAY 
Approximately 7 x 10 4 Molt cells were incubated 
25 with 1 x 10 4 CEM cells chronically infected with the 
HIV-l^ virus in 96-well plates (one-half area cluster 
plates; Costar, Cambridge, MA) in a final volume of 
100/zl culture medium as previously described 
(Matthews, T.J. et al . , 1987, Proc. Natl. Acad. Sci. 
USA M: 5424-5428) . Peptide inhibitors were added in 

30 

a volume of lO/il and the cell mixtures were incubated 
for 24 hr. at 37°C. At that time, multinucleated 
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giant cells were estimated by microscopic examination 
at a 4 Ox magnification which allowed visualization of 
the entire well in a single field. 

6.1.4. CELL FREE VIRUS INFECTION ASSAY 
5 Synthetic peptides were incubated at 37°C with 

either 247 TCID S0 (for experiment depicted in FIG. 2), 
or 62 TCID 50 (for experiment depicted in FIG. 3) units 
of HIV-Ijax virus or 25 TCID S0 units of HIV-2 NIHZ and CEM 
CD4 + cells at peptide concentrations of 0, 0.04, 0.4, 
10 4.0, and 40/^g/ml for 7 days. The resulting reverse 
transcriptase (RT) activity in counts per minute was 
determined using the assay described, below, in 
Section 6.1.5. See, Reed, L.J. et al . , 1938, Am. J. 
Hyg. 27: 493-497 for an explanation of TCID 50 
calculations . 

15 

6.1.5. REVERSE TRANSCRIPTASE ASSAY 
The micro-reverse transcriptase (RT) assay was 
adapted from Goff et al . (Goff, S. et al . . 1981, J. 
Virol. 28:239-248) and Willey et al . (Willey, R. et 

20 al., 1988, J. Virol. 62:139-147). Supernatants from 
virus/cell cultures are adjusted to 1% Triton-XlOO. A 
10/zl sample of supernatant was added to 50/zl of RT 
cocktail in a 96-well U-bottom microtitre plate and 
the samples incubated at 3 7°C for 90 min. The RT 

25 cocktail contained 75mM KC1, 2mM dithiothreitol, 5mM 
MgCl 2 , 5/zg/ml poly A (Pharmacia, cat. No. 27-4110-01), 
0.25 units/ml oligo dT (Pharmacia, cat. No. 27-7858- 
01), 0.05% NP40, 50mM Tris-HCl, pH 7.8, 0.5f*M non- 
radioactive dTTP, and 10/zCi/ml 32 P-dTTP (Amersham, cat. 
No. PB. 10167) . 

30 

After the incubation period, 40/il of reaction 
mixture was applied to a Schleicher and Schuell (S+S) 
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NA45 membrane (or DE81 paper) saturated in 2 x SSC 
buffer (0.3M NaCl and 0.003M sodium citrate) held in a 
S+S Minifold over one sheet of GB003 (S+S) filter 
paper, with partial vacuum applied. Each well of the 
minifold was washed four times with 200/zl 2xSSC, under 
5 full vacuum. The membrane was removed from the 

minifold and washed 2 more times in a pyrex dish with 
an excess of 2xSSC. Finally, the membrane was drained 
on absorbent paper, placed on Whatman #3 paper, 
covered with Saran wrap, and exposed to film overnight 
10 at -70°C. 

6.2. RESULTS 

6.2.1. PEPTIDE INHIBITION OF INFECTED CELL- 

INDUCED SYNCYTIA FORMATION " * 

The initial screen for antiviral activity assayed 

15 peptides 1 ability to block syncytium formation induced 

by overnight co-cultivation of uninfected Molt4 cells 

with chronically HIV-1 infected CEM cells. The 

results of several such experiments are presented 

herein. In the first of these experiments, serial 

20 DP178 (SEQ ID:1) peptide concentrations between 

10/ig/ml and 12.5ng/ml were tested for blockade of the 

cell fusion process. For these experiments, CEM cells 

chronically infected with either HIV-1^, HIV-1™, HIV- 

l RF , or HIV-l SF2 virus were cocultivated overnight with 

uninfected Molt 4 cells. The results (FIG. 4) show 

25 

that DP178 (SEQ ID:1) afforded complete protection 
against each of the HIV-1 isolates down to the lowest 
concentration of DP178 (SEQ ID;1) used. For HIV^ 
inhibition, the lowest concentration tested was 
I2.5ng/ml; for all other HIV-l viruses, the lowest 
30 concentration of DP178 (SEQ ID:1) used in this study 
was lOOng/ml. A second peptide, DP-180 (SEQ ID:2) , 
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containing the same amino acid residues as DP178 (SEQ 
ID:1) but arranged in a random order exhibited no 
evidence of anti-f usogenic activity even at the high 
concentration of 40jig/ml (FIG. 4) . These observations 
indicate that the inhibitory effect of DP178 (SEQ 
5 ID:1) is primary sequence-specific and not related to 
non-specific peptide/protein interactions. The actual 
endpoint ( i.e. , the lowest effective inhibitory 
concentration) of DP178 inhibitory action is within 
the range of 1-10 ng/ml. 
10 The next series of experiments involved the 

preparation and testing of a DP178 (SEQ ID:1) homolog 
for its ability to inhibit HIV- 1- induced syncytia 
formation. As shown in FIG. 1, the sequence of DP- 18 5 
(SEQ ID: 3) is slightly different from DP178 (SEQ ID:1) 
in that its primary sequence is taken from the HIV-1 SF2 

15 

isolate and contains several amino acid differences 
relative to DP178 (SEQ ID:1) near the N terminus. As 
shown in FIG. 4, DP-185 (SEQ ID:3) , exhibits 
inhibitory activity even at 312.5ng/ml, the lowest 
concentration tested. 

20 The next series of experiments involved a 

comparison of DP178 (SEQ ID:1) HIV-1 and HIV-2 
inhibitory activity. As shown in FIG. 5, DP178 (SEQ 
ID:1) blocked HIV-l-mediated syncytia formation at 
peptide concentrations below Ing/ml. DP178 (SEQ ID:1) 

25 failed, however, to block HIV-2 mediated syncytia 

formation at concentrations as high as 10/ig/ml. This 
striking 4 log selectivity of DP178 (SEQ ID:1) as an 
inhibitor of HIV-l-mediated cell fusion demonstrates 
an unexpected HIV-1 specificity in the action of DP178 
(SEQ ID:1) . DP178 (SEQ ID:1) inhibition of HIV-1 - 

30 

mediated cell fusion, but the peptide's inability to 
inhibit HIV-2 medicated cell fusion in the same cell 
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type at the concentrations tested provides further 
evidence for the high degree of selectivity associated 
with the antiviral action of DP178 (SEQ ID:1) . 

6.2.2. PEPTIDE INHIBITION OF INFECTION BY 
5 CELL -FREE VIRUS 

DP178 (SEQ ID:1) was next tested for its ability 

to block CD-4 + CEM cell infection by cell free HIV-1 

virus. The results, shown in FIG. 2, are from an 

experiment in which DP178 (SEQ ID:1) was assayed for 

its ability to block infection of CEM cells by an 

10 

HIV-l^j isolate. Included in the experiment were 
three control peptides, DP-116 (SEQ ID:9) , DP-125 (SEQ 
ID:8), and DP- 118 (SEQID:10). DP-116 (SEQID:9) 
represents a peptide previously shown to be inactive' * 
using this assay, and DP-125 (SEQ ID: 8; Wild, C. et 

15 al., 1992, Proc. Natl. Acad, Sci. USA .89 : 10 , 53 7) and 
DP- 118 (SEQ ID: 10) are peptides which have previously 
been shown to be active in this assay. Each 
concentration (0, 0.04, 0.4, 4, and 40/ig/ml) of 
peptide was incubated with 247 TCID S0 units of HIV-l^j 

2 0 virus and CEM cells. After 7 days of culture, cell- 
free supernatant was tested for the presence of RT 
activity as a measure of successful infection. The 
results, shown in FIG. 2, demonstrate that DP178 (SEQ 
ID:1) inhibited the de novo infection process mediated 
by the HIV-l viral isolate at concentrations as low as 

25 

90ng/ml (IC50:=90ng/ml) . In contrast, the two positive 
control peptides, DP-125 (SEQ: ID:8) and DP-118 (SEQ 
ID:10), had over 60-fold higher IC50 concentrations of 
approximately 5//g/ml. 

In a separate experiment, the HIV-1 and HIV- 2 
30 inhibitory action of DP178 (SEQ ID:1) was tested with 
CEM cells and either HIV-1™ or HIV-2 MIHZ . 62 TCID S0 



- 120 - 



WO 01/51673 



PCT/US00/35727 



HIV-l^j or 25 GCID 50 HIV-2 NIH2 were used in these 
experiments , and were incubated for 7 days , As may be 
seen in FIG. 3, DP178 (SEQ ID:1) inhibited HIV-1 
infection with an IC50 of about 3lng/ml. In contrast , 
DP178 (SEQ ID:1) exhibited a much higher IC50 for HIV- 
5 2 nihz' thus making DP178 (SEQ ID:1) two logs more potent 
as a HIV-l inhibitor than a HIV-2 inhibitor. This 
finding is consistent with the results of the fusion 
inhibition assays described, above, in Section 6.2.1, 
and further supports a significant level of 
10 selectivity ( i.e. , for HIV-1 over HIV-2) . 

7. EXAMPLE: THE HIV-1 INHIBITOR, 

DPI 7 8 (SEP ID:1) IS NON- CYTOTOXIC 

In this Example, the 36 amino acid synthetic 

peptide inhibitor DP178 (SEQ ID:1) is shown to be non- 

15 cytotoxic to cells in culture," even at the highest 

peptide concentrations (40/^g/ml) tested. 

7.1. MATERIALS AND METHODS 
Cell proliferation and toxicity assay: 
20 Approximately 3.8x10 s CEM cells for each peptide 

concentration were incubated for 3 days at 3 7°C in T25 
flasks. Peptides tested were DP178 (SEQ ID:1) and DP- 
116 (SEQ ID:9) f as described in FIG. 1. Peptides were 
synthesized as described, above, in Section 6.1. The 
concentrations of each peptide used were 0, 2.5, 10, 
and 4 0,ug/ml. Cell counts were taken at incubation 
times of 0, 24, 48, and 72 hours. 

7.2 . RESULTS 
Whether the potent HIV-1 inhibitor DP178 (SEQ 
30 ID:l) exhibited any cytotoxic effects was assessed by 
assaying the peptide's effects on the proliferation 



- 121 - 



WO 01/51673 



PCT/USQQ/35727 



and viability of cells in culture. CEM cells were 
incubated in the presence of varying concentrations of 
DP178 {SEQ ID:1), and DP-116 (SEQ ID:9), a peptide 
previously shown to be ineffective as a HIV inhibitor 
(Wild, C. et al . , 1992, Proc. Natl. Acad. Sci. USA 
5 89:10,537-10,541). Additionally, cells were incubated 
in the absence of either peptide. 

The results of the cytotoxicity study demonstrate 
that DP178 (SEQ ID:1) exhibits no cytotoxic effects on 
cells in culture. As can be seen, below, in Table VI, 

10 even the proliferation and viability characteristics 
of cells cultured for 3 days in the presence of the 
highest concentration of DP178 (SEQ ID:1) tested 
(4 0/xg/ml) do not significantly differ from the DP-116 
(SEQ ID: 9) or the no-peptide controls. The cell 
proliferation data is also represented in graphic form 
in FIG. 6. As was demonstrated in the Working Example 
presented above in Section 6, DP178 (SEQ ID:1) 
completely inhibits HIV-l mediated syncytia formation 
at peptide concentrations between l and lOng/ml, and 
completely inhibits cell -free viral infection at 

20 concentrations of at least 90ng/ml. Thus, this study 
demonstrates that even at peptide concentrations 
greater than 3 log higher than the HIV inhibitory 
dose, DP178 (SEQ ID:1) exhibits no cytotoxic effects. 



25 
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Table VI 

% Viability 
at time (hours) 

Peptide 

5 Peptide Concentration uQ/ml 0 24 48 72 



DP178 40 98 97 95 97 

(SEQ 
ID:1) 

10 98 97 98 98 

10 2.5 98 93 96 96 



DP116 40 98 95 98 97 

(SEQ 



15 



ID: 9) 



10 98 95 93 98 

2.5 98 96 98 99 



No 0 98 97 99 98 

Peptide 



20 

8 . EXAMPLE: THE INTERACTION OF DP178 AND DP107 
Soluble recombinant forms of gp41 used in the 
example described below provide evidence that the 
DP178 peptide associates with a distal site on gp41 

25 whose interactive structure is influenced by the DP107 
leucine zipper motif. A single mutation disrupting 
the coiled-coil structure of the leucine zipper domain 
transformed the soluble recombinant gp41 protein from 
an inactive to an active inhibitor of HIV-1 fusion. 

30 This transformation may result from liberation of the 
potent DP178 domain from a molecular clasp with the 
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leucine zipper, DP107, determinant. The results also 
indicate that the anti-HIV activity of various gp4l 
derivatives (peptides and recombinant proteins) may be 
due to their ability to form complexes with viral gp41 
and interfere with its fusogenic process. 

5 

8.1. MATERIALS AND METHODS 

8.1.1. CONSTRUCTION OF FUSION PROTEINS 
AND GP41 MUTANTS 

Construction of fusion proteins and mutants shown 

10 

in FIG. 7 was accomplished as follows: the DNA 
sequence corresponding to the extracellular domain of 
gp41 (540-686) was cloned into the Xmn I site of the 
expression vector pMal-p2 (New England Biolab) to give 
M41. The gp4l sequence was amplified from pgtat 

15 (Malim et al., 1988, Nature 355: 181-183) by using 
polymerase chain reaction (PCR) with upstream primer 
5 1 -ATGACGCTGACGGTACAGGCC-3 • (primer A) and downstream 
primer 5 « -TGACTAAGCTTAATACCACAGCCAATTTGTTAT-3 1 (primer 
B) . M41-P was constructed by using the T7-Gen 

20 in vitro mutagenesis kit from United States 
Biochemicals (USB) following the supplier's 
instructions. The mutagenic primer (5»- 
GGAGCTGCTTGGGGCCCCAGAC-3 1 ) introduces an He to Pro 
mutation in M41 at position 578. M41A107, from which 
the DP- 107 region has been deleted, was made using a 
deletion mutagenic primer 5 1 - 

CCATSlATCCCCAGGAGCTGCTCGAGCTGCACTATACCAGAC-3 1 (primer C) 
following the USB T7-Gen mutagenesis protocol. 
M41A178, from which the DP -17 8 region has been 
deleted, was made by cloning the DNA fragment 
30 corresponding to gp41 amino acids 540-642 into the 
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Xmn I site of pMal-p2. Primer A and 5'- 
ATAGCTTCTAGATTAATTGTTAATTTCTCTGTCCC-3 ! (primer D) were 
used in the PCR with the template pgtat to generate 
the inserted DNA fragments. M41-P was used as the 
template with primer A and D in PCR to generate M41- 
5 PA178 . All inserted sequences and mutated residues 
were checked by restriction enzyme analysis and 
confirmed by DNA sequencing. 

8.1.2. PURIFICATION AND CHARACTERIZATION 
OF FUSION PROTEINS 

The fusion proteins were purified according to 

the protocol described in the manufacturers brochure 

of protein fusion and purification systems from New 

England Biolabs (NEB). Fusion proteins (10 ng) were*" 

analyzed by electrophoresis on 8% SDS polyacrylamide 

gels. Western blotting analysis was performed as 

described by Sambrook et al . , 1989, Molecular Cloning: 

A Laboratory Manual, 2d Ed, Cold Spring Harbor 

Laboratory Press-, Cold Spring Harbor, NY, Ch. 18, 

pp. 64-75. An HIV-1 positive serum diluted 1000-fold, 

or a human Fab derived from repertoire cloning was 

used to react with the fusion proteins. .The second 

antibody was HRP- conjugated goat antihuman Fab. An 

ECL Western blotting detection system (Amersham) was 

used to detect the bound antibody. A detailed 

protocol for this detection system was provided by the 

manufacturer. Rainbow molecular weight markers 

(Amersham) were used to estimate the size of fusion 

proteins. 

8.1.3. CELL FUSION ASSAYS FOR ANTI-HIV ACTIVITY 
Cell fusion assays were performed as previously 
described (Matthews et al., 1987, Proc: Natl. Acad. 
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Sci. USA 84: 5424-5481). CEM cells (7 X 10 4 ) were 
incubated with HIV-l IIIB chronically infected CEM cells 
(10 4 ) in 96-well flat -bottomed half-area plates 
(Costar) in 100 /zl culture medium. Peptide and fusion 
proteins at various concentrations in 10 /il culture 
5 medium were incubated with the cell mixtures at 37°C 
for 24 hours. Multinucleated syncytia were estimated 
with microscopic examination. Both M41 and M41-P did 
not show cytotoxicity at the concentrations tested and 
shown in FIG . 8 . 
10 Inhibition of HIV-1 induced cell-cell fusion 

activity was carried out in the presence of 10 nM 
DP178 and various concentrations of M41A178 or M41- 
PA178 as indicated in FIG. 9. There was no observable 
syncytia in the presence of 10 nM DP178. No peptide 
or fusion protein was added in the control samples. 

8.1.4. EL ISA ANALYSIS OF DP178 BINDING 

TO THE LEUCINE ZIPPER MOTIF OF GP41 

The amino acid sequence of DP178 used is: 

YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF. For enzyme 

20 linked immunoassay (ELISA) , M41A178 or M41-PA178 (5 
/xg/ml) in 0.1M NaHC0 3/ pH 8.6, were coated on 96 wells 
Linbro ELISA plates (Flow Lab, Inc.) overnight. Each 
well was washed three times with distilled water then 
blocked with 3% bovine serum albumin (BSA) for 2 
hours. After blocking, peptides with 0.5% BSA in TBST 
(40 mM Tris-HCl pH7.5, 150 mM NaCl, 0.05% Tween 20) 
were added to the ELISA plates and incubated at room 
temperature for 1 hour. After washing three times 
with TBST, Fab-d was added at a concentration of 10 
ng/ml with 0.5% BSA in TBST. The plates were washed 

30 three times with TBST after incubation at room 

temperature for l hour. Horse radish peroxidase (HRP) 
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conjugated goat antihuman Fab antiserum at a 2000 fold 
dilution in TBST with 0.5% BSA was added to each well 
and incubated at room temperature for 45 minutes. The 
plates were then washed four times with TBST. The 
peroxidase substrate o-phenylene diamine (2.5 mg/ml) 
5 and 0.15% H 2 0 2 were added to develop the color. The 
reaction was stopped with an equal volume of 4.5 N 
H 2 S0 4 after incubation at room temperature for 10 
minutes. The optical density of the stopped reaction 
mixture was measured with a micro plate reader 
!0 (Molecular Design) at 4 90 nm. Results are shown in 
FIG. 10. 

8.2. RESULTS 

8.2.1. THE EXPRESSION AND CHARACTERIZATION 
OF THE ECTODOMAIN OF gp41 

15 As a step toward understanding the roles of the 

two helical regions in gp4l structure and function, 
the ectodomain of gp41 was expressed as a maltose 
binding fusion protein (M41) (FIG. 7) . The fusogenic 
peptide sequence at the N- terminal of gp41 was omitted 

20 from this recombinant protein and its derivatives to 
improve solubility. The maltose binding protein 
facilitated purification of the fusion proteins under 
relatively mild, non-denaturing conditions. Because 
the M41 soluble recombinant gp41 was not glycosylated, 

^ lacked several regions of the transmembrane protein 
( i.e. , the fusion peptide, the membrane spanning, and 
the cytoplasmic domains) , and was expressed in the 
absence of gpl20, it was not expected to precisely 
reflect the structure of native gp41 on HIV-l virions. 
Nevertheless, purified M41 folded in a manner that 

30 preserved certain discontinuous epitopes as evidenced 
by reactivity with human monoclonal antibodies, 98-6, 
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126-6, and 50-69, previously shown to bind 
conformational epitopes on native gp41 expressed in 
eukaryotic cells (Xu et al., 1991, J. Virol. 65: 4832- 
4838; Chen, 1994, J. Virol. 68:2002-2010). Thus, at 
least certain regions of native gp41 defined by these 
5 antibodies appear to be reproduced in the recombinant 
fusion protein M41. Furthermore, M41 reacted with a 
human recombinant Fab (Fab-d) that recognizes a 
conformational epitope on gp41 and binds HIV-l virions 
as well as HIV-l infected cells but not uninfected 
10 cells as analyzed by FACS. Deletion of either helix 
motif, i.e. , DP107 or DP178, of the M41 fusion protein 
eliminated reactivity with Fab-d. These results 
indicate that both helical regions, separated by 60 
amino acids in the primary sequence, are required to 
maintain the Fab-d epitope. 

8.2.2. ANTI-HIV ACTIVITY OF THE 

RECOMBINANT ECTODOMAIN OF GP41 

The wild type M41 fusion protein was tested for 

anti-HIV-1 activity. As explained, supra . synthetic 

20 peptides corresponding to the leucine zipper (DP107) 

and the C- terminal putative helix (DP178) show potent 

anti-HIV activity. Despite inclusion of both these 

regions, the recombinant M41 protein did not affect 

HIV-l induced membrane fusion at concentrations as 

high as 50 (Table VII, below). 



30 
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Table VII 

DISRUPTION OF THE LEUCINE ZIPPER OF 
GP41 FREES THE ANTI-HIV MOTIF 



5 DP107 DPI 78 Mil M41-P M41-PA178 

Cell fusion 

(IC 90 ) 1 jiM 1 nM >50 \xU 83 nM >50 \iM 
Fab-D 

binding (k D ) - - 3.5xl0* 9 2.5xl0' 8 

HIV infectiv- 

ityaC^ 1 iiM 80 nM >16pM 66 nM >8 ^M 



1 The affinity constants of Fab-d binding to the fusion proteins were determined 
using a protocol described by B. Friguet et al., 1985, J. Immunol. Method. 
77:305-319. 

15 

- = No detectable binding of Fab-d to the fusion proteins. 

Antiviral Infectivity Assays. 20 \x\ of serially diluted virus stock was incubated 
for 60 minutes at ambient temperature with 20 ^1 of the indicated concentration 
of purified recombinant fusion protein in RPMI 1640 containing 10% fetal 
bovine serum and antibiotics in a 96-well microtiter plate. 20 ^1 of CEM4 cells 
at 6 x 10 5 cells/ml were added to each well, and cultures were incubated at 37 °C 

20 in a humidified C0 2 incubator. Cells were cultured for 9 days by the addition of 

fresh medium every 2 to 3 days. On days 5, 7, and 9 postinfection, supernatant 
samples were assayed for reverse transcriptase (RT) activity, as described 
below, to monitor viral replication. The 50% tissue culture infectious dose 
(TCID 50 ) was calculated for each condition according to the formula of Reed & 
Muench, 1937, Am. J. Hyg. 27:493-497. RT activity was determined by a 
modification of the published methods of Goff et al., 1981, J. Virol 38:239-248 
and Willey et al., 1988, J. Virol. 62:139-147 as described in Chen et al., 1993, 

2 5 AIDS Res. Human Retroviruses 9:1079-1086. 



Surprisingly, a single amino acid substitution, 
proline in place of isoleucine in the middle of the 
leucine zipper motif, yielded a fusion protein (M41-P) 
which did exhibit antiviral activity (Table XXV and 
Fig. 8). As seen in Table XXV, M41-P blocked syncytia 
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formation by 90% at approximately 85 nM and 
neutralized HIV-1 11IB infection by 90% at approximately 
70 nM concentrations. The anti-HIV-1 activity of M41- 
P appeared to be mediated by the C-terminal helical 
sequence since deletion of that region from M41-P 
5 yielded an inactive fusion protein, M41-PA178 (Table 
XXV) . This interpretation was reinforced by 
experiments demonstrating that a truncated fusion 
protein lacking the DP178 sequence, M41A178 , abrogated 
the potent ant i- fusion activity of the DP178 peptide 
10 in ^ concentration- dependent manner (FIG. 9) . The 
same truncated fusion protein containing the proline 
mutation disrupting the leucine zipper, M41-PA178, was 
not active in similar competition experiments (FIG. 
9) . The results indicate that the DP178 peptide 
associates with a second site on gp41 whose 

15 

interactive structure is dependent on a wild type 
leucine zipper sequence. A similar interaction may 
occur within the wild type fusion protein, M41, and 
act to form an intramolecular clasp which sequesters 
the DP178 region, making it unavailable for anti-viral 

20 activity. 

A specific association between these two domains 
is also indicated by other human monoclonal Fab-d 
studies. For example, Fab-d failed to bind either the 
DP178 peptide or the fusion protein M41A178, but its 

25 epitope was reconstituted by simply mixing these two 
reagents together (FIG. 10) . Again, the proline 
mutation in the leucine zipper domain of the fusion 
protein, M41-PA178, failed to reconstitute the epitope 
in similar mixing experiments. 
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9. EXAMPLE: METHOD FOR COMPUTER- ASSISTED 
IDENTIFICATION OF DP107-LIKE 
AND DP178-LIKE SEQUENCES 

A number of known coiled- coil sequences have been 
well described in the literature and contain heptad 
5 repeat positioning for each amino acid. Coiled-coil 
nomenclature labels each of seven amino acids of a 
heptad repeat A through G, with amino acids A and D 
tending to be hydrophobic positions. Amino acids E 
and G tend to be charged. These four positions (A, D, 
E, and G) form the amphipathic backbone structure of a 

10 

monomeric alpha-helix. The backbones of. two or more 
amphipathic helices interact with each other to form 
di-, tri-, tetrameric, etc., coiled-coil structures. 
In order to begin to design computer search motifs, a* 
series of well characterized coiled coils were chosen 

15 including yeast transcription- factor GCN4 , Influenza 
Virus hemagglutinin loop 36, and human proto- oncogenes 
c-Myc, c-Fos, and c-Jun. For each peptide sequence, a 
strict homology for the A and D positions, and a list 
of the amino acids which could be excluded for the B, 

20 C, E, F, and G positions {because they are not 

observed in these positions) was determined. Motifs 
were tailored to the DP107 and DP178 sequences by 
deducing the most likely possibilities for heptad 
positioning of the amino acids of HIV-1 Bru DP- 107, 
which is known to have coiled-coil structure, and HIV- 

25 

1 Bru DP178, which is still structurally undefined. 
The analysis of each of the sequences is contained in 
FIG. 12. For example, the motif for GCN4 was designed 
as follows: 

1. The only amino acids (using standard single 
30 letter amino acid codes) found in the A or D 

positions of GCN4 were [LMNV] . 
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2. All amino acids were found at B, C, E, F, and G 
positions except {CFGIMPTW}. 

3. The PES EAR CH motif would, therefore, be written 
as follows : 

[LMNV] - {CFGIMPTW } (2) - [LMNV] -{ CFGIMPTW} (3)- 
5 [LMNV] - {CFGIMPTW} (2) - [LMNV] - { CFGIMPTW} (3) - 

[LMNV] - {CFGIMPTW} (2) - [LMNV] - {CFGIMPTW } (3) - 
[LMNV] - {CFGIMPTW} (2) - [LMNV] -{CFGIMPTW} (3) 

Translating or reading the motif: "at the first A 
10 position either L, M, N, or V must occur; at positions 
B and C (the next two positions) accept everything 
except C, F, G, I, M, P, T, or W; at the D position 
either L, M, N, or V must occur; at positions E, F, 
and G (the next 3 positions) accept everything except 
^ C, F # G, I, M, P, T, or W." This statement is 

contained four times in a 28-mer motif and five. times 
in a 35-mer motif. The basic motif key then would be: 
[LMNV] -{CFGIMPTW} . The motif keys for the remaining 
well described coiled- coil sequences are summarized in 
FIG. 12. 

20 The motif design for DP107 and DP178 was slightly 

different than the 28-mer model sequences described 
above due to the fact that heptad repeat positions are 
not defined and the peptides are both longer than 2 8 
residues. FIG. 13 illustrates several possible 

25 sequence alignments for both DP107 and DP178 and also 
includes motif designs based on 28-mer, 35-mer, and 
full-length peptides. Notice that only slight 
differences occur in the motifs as the peptides are 
lengthened. Generally, lengthening the base peptide 

3Q results in a less stringent motif. This is very 

useful in broadening the possibilities for identifying 
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DP107-or DP-178-like primary amino acid sequences 
referred to in this document as "hits". 

In addition to making highly specific motifs for 
each type peptide sequence to be searched, it is also 
possible to make "hybrid" motifs. These motifs are 
5 made by "crossing" two or more very stringent motifs 
to make a new search algorithm which will find not 
only both "parent" motif sequences but also any 
peptide sequences which have similarities to one, the. 
other, or both "parents". For example, in FIG. 14 the 

10 "parent" sequence of GCN4 is crossed with each of the 
possible "parent" motifs of DP-107. Now' the hybrid 
motif must contain all of the amino acids found in the 
A and D positions of both parents, and exclude all of 
the amino acids not found in either parent at the 
other positions. The resulting hybrid from crossing 
GCN4 or [LMNV] {CFGIMPTW} and DPI 07 (28 -mer with the 
first L in the D position) or [ILQT] {CDFIMPST} , is 
■ [ILMNQTV] {CFIMPT} . Notice that now only two basic 
hybrid motifs exist which cover both framing 
possibilities, as well as all peptide lengths of the 

20 parent DP-107 molecule. FIG. 15 represents the 
"hybridizations" of GCN4 with DP-178. FIG. 16 
represents the "hybridizations" of DP107 and DP178. 
It is important to keep in mind that the represented 
motifs, both parent and hybrid, are motif keys and not 

25 the depiction of the full-length motif needed to 
actually do the computer search. 

Hybridizations can be performed on any 
combination of two or more motifs. FIG. 17 
summarizes several three-motif hybridizations 
including GCN4, DP107 (both frames), and DP178 (also 

30 

both frames) . Notice that the resulting motifs are 
now becoming much more similar to each other. In 
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fact, the first and third hybrid motifs are actually 
subsets of the second and fourth hybrid motifs 
respectively. This means that the first and third 
hybrid motifs are slightly more stringent than the 
second and fourth. It should also be noted that with 1 
5 only minor changes in these four motifs, or by 

hybridizing them, a single motif could be obtained 
which would find all of the sequences. However, it 
should be remembered that stringency is also reduced. 
Finally, the most broad- spectrum and least- stringent 

10 hybrid motif is described in FIG. 18 which summarizes 
the hybridization of GCN4, DP107 (both frames), DP178 
(both frames), c-Fos, c-Jun, c-Myc, and Flu loop 36. 

A special set of motifs was designed based on the 
fact that DP -17 8 is located only approximately ten 
amino acids upstream of the transmembrane spanning 
region of gp4l and just C-terminal to a proline, which 
separates DP107 and DP178. It has been postulated 
that DP178 may be an amphipathic helix when membrane 
associated, and that the proline might aid in the 
initiation of the helix formation. The same 

20 arrangement was observed in Respiratory Syncytial 
Virus; however, the DPl78-like region in this virus 
also had a leucine zipper just C-terminal to the 
proline. Therefore, N-terminal proline -leucine zipper 
motifs were designed to analyze whether any other 

25 viruses might contain this same pattern. The motifs 
are summarized in FIG. 19. 

The PC/Gene protein database contains 5879 viral 
amino acid sequences (library file PVIRUSES; CD-ROM 
release 11.0) . Of these, 1092 are viral enveloped or 
glycoprotein sequences (library file PVIRUSE1) . 
Tables V through XIV contain lists of protein sequence 
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names and motif hit locations for all the motifs 
searched. 

10. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION 
OF DP107 AND DP178-LIKE SEQUENCES 
IN HUMAN IMMUNODEFICIENCY VIRUS 

5 

FIG. 20 represents search results for HIV-1 BRU 
isolate gp41 (PC/Gene protein sequence PENV_HV1BR) . 
Notice that the hybrid motif which crosses DP-107 and 
DP- 178 (named 107x178x4; the same motif as found in 
FIG. 16 found three hits including amino acids 550- 

10 

599, 636-688, and 796-823. These areas include DP-107 
plus eight N- terminal and four C-terminal amino acids; 
DP178 plus seven N- terminal and ten C-terminal amino 
acids; and an area inside the transmembrane region 
(cytoplasmic) . FIG. 20 also contains the results 

15 obtained from searching with the motif named ALLM0TI5, 
for which the key is found in FIG. 17 ( { CDGHP } 
{CFP}x5) . This motif also found three hits including 
DP107 (amino acids 510-599), DP178 (615-717), and a 
cytoplasmic region (772-841) . These hits overlap the 

20 hits found by the motif 107x178x4 with considerable 
additional sequences on both the amino and carboxy 
termini. This is not surprising in that 107x178x4 is 
a subset of the ALLMOTI5 hybrid motif. Importantly, 
even though the stringency of ALLM0TI5 is considerably 
less than 107x178x4, it still selectively identifies 

2 5 

the DP107 and DP178 regions of gp41 shown to contain 
sequences for inhibitory peptides of HIV-1. The 
results of these two motif searches are summarized in 
Table V of U.S. Patent Application Serial No. 
08/470,896 filed on June 6, 1995 (incorporated herein 
30 by reference in its entirety) under the PC/Gene 

protein sequence name PENV_HV1BR. The proline-leucine 
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zipper motifs also gave several hits in HIV-1 BRU 
including 503-525 which is at the very C- terminus of 
gpl20, just upstream of the cleavage site (P7LZIPC and 
P12LZIPC) ; and 735-768 in the cytoplasmic domain of 
gp4l (P23LZIPC) . These results are found in Tables 
5 VIII, IX, and X under the same sequence name as 

mentioned above. Notice that the only area of HIV-1 
BRU which is predicted by the Lupas algorithm to 
contain a coiled-coil region, is from amino acids 635- 
670. This begins eight amino acids N-terminal to the 
lo start and ends eight amino acids N-terminal to the end 
of DP178. DP107, despite the fact that it is a known 
coiled coil, is not predicted to contain a coiled-coil 
region using the Lupas method. 

11. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION 
15 OF DP107-LIKE AND DP178-LIKE 

SEQUENCES IN HUMAN RESPIRATORY 
SYNCYTIAL VIRUS 

FIG. 21 represents search results for Human 

Respiratory Syncytial Virus (RSV; Strain A2) fusion 

glycoprotein Fl (PC/Gene protein sequence name PVGLF 
20 ~ 
HRSVA) . Motif 107x178x4 finds three hits including 

amino acids 152-202, 213-243, and 488-515. The 

arrangement of these hits is similar to what is found 

in HIV-1 except that the motif finds two regions with 

similarities to DP- 178, one just downstream of what 

25 would be called the DP107 region or amino acids 213- 
243, and one just upstream of the transmembrane region 
(also similar to DP178) or amino acids 488-515. Motif 
ALLM0TI5 also finds three areas including amino acids 
116-202, 267-302, and 506-549. The proline -leucine 

30 zipper motifs also gave several hits including amino 
acids 205-221 and 265-287 (P1LZIPC 265-280, P12LZIPC) , 
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and 484-513 (P7LZIPC and P12LZIPC 484-506, P23LZIPC) . 
Notice that the PLZIP motifs also identify regions 
which share location similarities with DP- 178 of HIV- 
1. 

5 12. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP107-LIKE AND DP178-LIKE SEQUENCES 
IN SIMIAN IMMUNODEFICIENCY VIRUS 

Motif hits for Simian immunodeficiency Virus gp41 
(AGM3 isolate; PC/Gene protein sequence name 
PENV_SIVAG) are shown in FIG. 22. Motif 107x178x4 
finds three hits including amino acids 566-593, 597- 
624, and 703-730. The first two hits only have three 
amino acids between them and could probably be 
combined into one hit from 566-624 which would 
represent a DP107-like hit. Amino acids 703 to 730 
would then represent a DP178-like hit. ALLMOTI5 also 
finds three hits including amino acids 556-628 (DP107- 
like) , 651-699 (DP178-like) , and 808-852 which 
represents the transmembrane spanning region. SIV 
also has one region from 655-692 with a high 
propensity to form a coiled coil as predicted by the 
Lupas algorithm. Both 107x178x4 and ALLMOTI5 motifs 
find the same region. SIV does not have any PLZIP 
motif hits in gp41. 

The identification of DP178/DP107 analogs for a 
second SIV isolate (MM251) is demonstrated in the 
Example presented, below, in Section 19. 

13. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP107-LIKE AND DP178 LIKE SEQUENCES 
IN CANINE DISTEMPER VIRUS 

30 Canine Distemper Virus (strain Onderstepoort) 

fusion glycoprotein Fl (PC/Gene Protein sequence name 

- 137 - 



10 



15 



20 



25 



WO 01/51673 



PCT/USOO/35727 



PVGLF_CDVO) has regions similar to Human RSV which are 
predicted to be DP107-like and DP178-like (FIG. 23) . 
Motif 107x178x4 highlights one area just C-terminal to 
ttie fusion peptide at amino acids 252-293. Amino 
acids 252-286 are also predicted to be coiled coil 
5 using the Lupas algorithm. Almost 100 amino acids C- 
terminal to the first region is a DP178-like area at 
residues 340-367. ALLMOTI5 highlights three areas of 
interest including: amino acids 228-297, which 
completely overlaps both the Lupas prediction and the 

10 DP107-like 107x178x4 hit; residues 340-381, which 
overlaps the second 107x178x4 hit; and amino acids 
568-602, which is DP178-like in that it is located 
just N-terminal to the transmembrane region. It also 
overlaps another region (residues 570-602) predicted 

^ by the Lupas method to have a high propensity to form 
a coiled coil. Several PLZIP motifs successfully 
identified areas of interest including P6 and P12LZIPC 
which highlight residues 336-357 and 336-361 
respectively; PI and P12LZIPC which find residues 398- 
414; and P12 and P23LZIPC which find residues 562-589 

20 and 562-592 respectively. 



14. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP107-LIKE AND DP178-LIKE SEQUENCES 
IN NEWCASTLE DISEASE VIRUS 

FIG. 24 shows the motif hits found in Newcastle 
Disease Virus (strain Australia-Victoria/32 ; PC Gene 
protein sequence name PVGLF_NDVA) . Motif 107x178x4 
finds two areas including a DPl07-like hit at amino 
acids 151-178 and a DPl78-like hit at residues 426- 
512. ALLM0TI5 finds three areas including residues 
117-182, 231-272, and 426-512. The hits from 426-512 
include a region which is predicted by the Lupas 
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method to have a high coiled- coil propensity (460- 
503) . The PLZIP motifs identify only one region of 
interest at amino acids 273-289 (PI and 12LZIPC) . 

15. EXAMPLE: COMPUTER- ASSISTED IDENTIFICATION 
OF DP107-LIKE AND DP178-LIKE 
SEQUENCES IN HUMAN PARAINFLUENZA VIRUS 

Both motifs 107x178x4 and ALLMOTI5 exhibit 
DP107-like hits in the same region, 115-182 and 117- 
182 respectively, of Human Parainfluenza Virus (strain 
NIH 47885; PC/Gene protein sequence name PVGLF_pl3H4; 
(FIG. 25) . In addition, the two motifs .have a DP178- 
like hit just slightly C-terminal at amino acids 207- 
241. Both motifs also have DPl78-like hits nearer the 
transmembrane region including amino acids 457-4 97 and 
462-512 respectively. Several PLZIP motif hits are 
also observed including 283-303 (P5LZIPC) , 283-310 
(P12LZIPC), 453-474 (P6LZIPC) , and 453-481 (P23LZIPC) . 
The Lupas algorithm predicts that amino acids 122-176 
may have a propensity to form a coiled-coil. 

20 16. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 

DP107-LIKE AND DP178-LIKE SEQUENCES OF 
INFLUENZA A VIRUS 

FIG. 2 6 illustrates the Lupas prediction for a 
coiled coil in Influenza A Virus (strain A/Aichi/2/68) 
at residues 379-436, as well as the motif hits for 

25 107x178x4 at amino acids 387-453, and for ALLMOTI5 at 
residues 380-456. Residues 383-471 (38-125 of HA2) 
were shown by Carr and Kim to be an extended coiled 
coil when under acidic pH (Carr and Kim, 1993, Cell 
73: 823-832). The Lupas algorithm predicts a coiled- 

30 coil at residues 379-436. All three methods 

successfully predicted the region shown to actually 



10 



15 
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have coiled-coil structure; however, ALLMOTI5 
predicted the greatest portion of the 88 residue 
stretch. 

17. EXAMPLE: POTENTIAL RESPIRATORY SYNCYTIAL VIRUS 
5 DP178/DP107 ANALOGS: CD AND 

ANTIVIRAL CHARACTER 1 2 AT I ON 

In the Example presented herein, respiratory 

syncytial virus (RSV) peptides identified by utilizing 

the computer-assisted search motifs described in the 

Examples presented in Sections 9 and 11, above/ were 

10 tested for ant i -RSV activity. Additionally, circular 
dichroism (CD) structural analyses were conducted on 
the peptides, as discussed below. It is demonstrated 
that several of the identified peptides exhibit potent 
antiviral capability. Additionally, it is shown that 

15 several of these peptides exhibit a substantial 
helical character. 



17.1 MATERIALS AND METHODS 
Structural analyses : The CD spectra were 
measured in a lOmM sodium phosphate, 150mM sodium 

20 

chloride, pH 7.0, buffer at approximately lOmM 
concentrations, using a 1 cm pathlength cell on a 
Jobin/Yvon Autodichrograph Mark V CD 
spectrophotometer. Peptides were synthesized 
according to the methods described, above, in Section 

25 6.1. Peptide concentrations were determined from A^o 
using Edlehoch's method (1967, Biochemistry 6:1948) . 

Anti-RSV antiviral activity assays : The assay 
utilized herein tested the ability of the peptides to 
disrupt the ability of HEp2 cells acutely infected 

30 with RSV ( i.e. , cells which are infected with a 

multiplicity of infection of greater than 2) to fuse 
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and cause syncytial formation on a monolayer of 
uninfected an uninfected line of Hep-2 cells. The 
lower the observed level of fusion, the greater the 
antiviral activity of the peptide was determined to 
be. 

5 Uninfected confluent monolayers of Hep-2 cells 

were grown in microtiter wells in 3% EMEM (Eagle 
Minimum Essential Medium w/o L-glut amine [Bio 
Whittaker Cat. No. 12-125F] , with fetal bovine serum 
[FBS; which had been heat inactivated for 30 minutes 
10 at 56°C; Bio Whittaker Cat. No. 14 -5 OIF) supplemented 
at 3%, antibiotics (penicillin/streptomycin; Bio 
Whittaker Cat. No. 17-602E) added at 1%, and glutamine 
added at 1%. 

To prepare Hep2 cells for addition to uninfected 
cells, cultures of acutely infected Hep2 cells were 

15 

washed with DPBS (Dulbecco 1 s" Phosphate Buffered Saline 
w/o calcium or magnesium; Bio Whittaker Cat. No. 17- 
512F) and cell monolayers were removed with Versene 
(1:5000; Gibco Life Technologies Cat. No. 15040-017). 
The cells were spun 10 minutes and resuspended in 3% 

20 FBS. Cell counts were performed using a 

hemacytometer. Persistent cells were added to the. 
uninfected Hep-2 cells. 

The antiviral assay was conducted by, first, 
removing all media from the wells containing 

25 uninfected Hep-2 cells, then adding peptides (at the 
dilutions described below) in 3% EMEM, and 100 acutely 
RSV-infected Hep2 cells per well. Wells were then 
incubated at 37°C for 48 hours. 

After incubation, cells in control wells were 

3q checked for fusion centers, media was removed from the 
wells, followed by addition, to each well, of either 
Crystal Violet stain or XTT. With respect to Crystal 
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Violet, approximately 50/zl 0.25% Crystal Violet stain 
in methanol were added to each well. The wells were 
rinsed immediately, to remove excess stain, and were 
allowed to dry. The number of syncytia per well were 
then counted, using a dissecting microscope. 
5 With respect to XTT (2 , 3-bis [2-Methoxy-4-nitro-5- 

sulfophenyl] -2H-tetrazolium-5-carboxyanilide inner 
salt), 50/il XTT (Img/ml in RPMI buffered with lOOmM 
HEPES, pH 7.2-7.4, plus 5% DMSO) were added to each 
well. The OD 450/690 was measured (after blanking against 
10 growth medium without cells or reagents, and against 
reagents) according to standard procedures. 

Peptides : The peptides characterized in the 
study presented herein were: 

1) peptides T-142 to T-155 and T-575, as shown in FIG. 
27A, and peptides T-22 to T-27, T-68, T-334 and T-371 
to T-3 75 and T-575, as shown in FIG. 27B; 

2) peptides T-120 to T-141 and T-576, as shown in FIG. 
27B, and peptides T-12, T-13, T-15, T-19, T-28 to T- 
30, T-66, T-69, T-70 and T-576, as shown in FIG. 27D; 
and 

20 3) peptides T-67 and T-104 to T-119 and T-384, as 

shown in FIG. 28A, and peptides T-71, T-613 to T-617, 
T-662 to T-676 and T-730, as shown in FIG. 28B. 

The peptides of group l represent portions of the 
RSV F2 protein DP178/l07-like region. The peptides of 

25 group 2 represent portions of the RSV Fl protein 

DP107-like region. The peptides of groups 3 represent 
portions of the RSV Fl protein DP178-like region. 

Each peptide was tested at 2 -fold serial 
dilutions ranging from 100/zg/ml to approximately 

3q lOOng/ml. For each of the assays, a well containing 
no peptide was also used. The IC 50 data for each 
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peptide represents the average of several experiments 
conducted utilizing that peptide. 

17.2 RESULTS 
The data summarized in FIGS. 27A-B and 28A-B 
5 represent antiviral and structural information 
obtained from peptides derived from the RSV F2 
DP178/DP107-like F2 region (FIG. 27A-B) , the RSV Fl 
DP-107-like region (FIG. 27C-D) and the RSV DP178-like 
F2 region (FIG. 28A-B) . 
10 As shown in FIGS. 27A-D, a number of the RSV 

DP178/DP107-like peptides exhibited a detectable level 
of antiviral activity. Peptides from the RSV 
DP178/DP107-like F2 region (FIG. 27A-B) , for example, 
T-142 to T-145 and T-334 purfied peptides, exhibited 
detectable levels of antiviral activity, as evidenced 
by their IC 50 values. Further, a number of RSV Fl 
DP107-like peptides (FIG. 27C-D) exhibited a sizable 
level of antiviral activity as purified peptides, 
including, for example, peptides T-124 to T-127, T- 
131, T-135 and T-137 to T-139, as demonstrated by 
20 their low IC S0 values. In addition, CD analysis FIG. 
27A, 27C) reveals that many of the peptides exhibit 
some detectable level of helical structure. 

The results summarized in FIG. 28A-B demonstrate 
that a number of DP178-like purified peptides exhibit 
25 a range of potent anti-viral activity. These peptides 
include, for example, T-67, T-104, T-105 and T-107 to 
T-119, as listed in FIG. 28A, and T-665 to T-669 and 
T-671 to T-673, as listed in FIG. 28B. In addition, 
some of the DP178-like peptides exhibited some level 
of helicity. 

Thus, the computer assisted searches described, 
hereinabove, successfully identified viral peptide 
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domains that represent highly promising anti-RSV 
antiviral compounds. 



18. EXAMPLE: POTENTIAL HUMAN PARAINFLUENZA VIRUS 
TYPE 3 DP178/DP107 ANALOGS: CD AND 
ANTIVIRAL CHARACTERIZATION 

In the Example presented herein, human 
parainfluenza virus type 3 (HPIV3) peptides identified 
by utilizing the computer-assisted search motifs 
described in the Examples presented in Sections 9 and 
15 , above, were tested for anti-HPIV3 activity. 
Additionally, circular dichroism (CD) structural 
analyses were conducted on the peptides, as discussed 
below. It is demonstrated that several of the 
identified peptides exhibit potent antiviral 
capability. Additionally, it is shown that several of 
these peptides exhibit a substantial helical 
character. 



18.1 MATERIALS AND METHODS 
Structural analyses : Structural analyses 

20 consisted of circular dichroism (CD) studies. The CD 
spectra were measured in a lOmM sodium phosphate, 
150mM sodium chloride, pH 7.0, buffer at approximately 
lOmM concentrations, using a 1 cm pathlength cell on a 
Jobin/Yvon Autodichrograph Mark V CD 
spectrophotometer. Peptide concentrations were 
determined from A 280 using Edlehoch's method (1967, 
Biochemistry 6:1948) . 

Anti-HPIV3 antiviral activity assays : The assay 
utilized herein tested the ability of the peptides to 
disrupt the ability of Hep2 cells chronically infected 

30 with HPIV3 to fuse and cause syncytial formation on a 
monolayer of an uninfected line of CV-1W cells. The 
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more potent the lower the observed level of fusion, 
the greater the antiviral activity of the peptide. 

Uninfected confluent monolayers of CV-1W cells 
were grown in microtiter wells in 3% EMEM (Eagle 
Minimum Essential Medium w/o L-glutamine [Bio 
5 Whittaker Cat. No. 12-125F] , with fetal bovine serum 
[FBS; which had been heat inactivated for 3 0 minutes 
at 56°C; Bio Whittaker Cat. No. 14-501F) supplemented 
at 3%, antibiotics/ant imycotics (Gibco BRL Life 
Technologies Cat. No. 15040-017) added at 1%, and 

10 glutamine added at 1%. 

To prepare Hep2 cells for addition to uninfected 
cells, cultures of chronically infected Hep2 cells 
were washed with DPBS (Dulbecco's Phosphate Buffered 
Saline w/o calcium or magnesium; Bio Whittaker Cat. 
No. 17-512F) and cell monolayers were removed with 
Versene (1:5000; Gibco Life Technologies Cat. No. 
15040-017) . The cells were spun 10 minutes and 
resuspended in 3% FBS. Cell counts were performed 
using a hemacytometer. Persistent cells were added to 
the uninfected CV-1W cells. 

20 The antiviral assay was conducted by, first, 

removing all media from the wells containing 
uninfected CV-1W cells, then adding peptides (at the 
dilutions described below) in 3% EMEM, and 500 
chronically HPIV3- infected Hep2 cells per well. Wells 

25 were then incubated at 37°C for 24 hours. 

On day 2, after cells in control wells were 
checked for fusion centers, media was removed from the 
wells, followed by addition, to each well, of 
approximately 50/il 0.25% Crystal Violet stain in 

3o methanol. Wells were rinsed immediately, to remove 

excess stain and were then allowed to dry. The number 
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of syncytia per well were then counted, using a 
dissecting microscope. 

Alternatively, instead of Crystal Violet 
analysis, cells were assayed with XTT, as described, 
avove, in Section 17.1. 
5 Peptides: The peptides characterized in the 

study presented herein were: 

1) Peptides 157 to 188, as shown in FIG. 29A, and 
peptides T-38 to T-40, T-42 to T-46 and T-582, as 
shown in FIG. 29B. These peptides are derived 

10 from the DPI 07 region of the HPIV3 Fl fusion 

protein (represented by HPF3 107, as shown in 
FIG. 29A) ; and 

2) Peptides 189 to 210, as shown in FIG. 30A, and T- 
269, T-626, T-383 and T-577 to T-579, as shown in 

15 FIG. 3 0B. These peptides are primarily derived 

from the DP178 region of the HPIV3 Fl fusion 
protein (represented by HPF3 178, as shown in 
FIG. 3 OA) . Peptide T-626 contains two mutated 
amino acid resides (represented by a shaded 
background) . Additionally, peptide T-577 

2 0 

represents Fl amino acids 65-100, T-578 
represents Fl amino acids 207-242 and T-579 
represents Fl amino acids 273-309. 

Each peptide was tested at 2 -fold serial 
25 dilutions ranging from 500/zg/ml to approximately 

500ng/ml. For each of the assays, a well containing 
no peptide was also used. 

18.2 RESULTS 

30 The data summa ^ized in FIGS. 29A-C and 30A-B 

represent antiviral and structural information 
obtained from peptides derived from the HPIV3 fusion 
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protein DP107-like region (FIG. 29A-C) and the HPIV3 
fusion protein DP178-like region (FIG. 30A-B) . 

As shown in FIG. 29A-B, a number of the HPIV3 
DPl07-like peptides exhibited potent levels of 
antiviral activity. These peptides include, for 
5 example, peptides T-40, T-172 to T-175, T-178, T-184 
and T-185. 

CD analysis reveals that a number of the peptides 
exhibit detectable to substantial lfevel of helical 
structure. The CD spectra for one of the peptides, 
10 184, which exhibits substantial helicity is summarized 
in FIG. 29C. 

The results summarized in FIG. 3 0A-B demonstrate 
that a number of the DP178-like peptides tested 
exhibit a range of ant i- viral activity. These 
peptides include, for example, peptides 194 to 211, as 
evidenced by their low IC 50 values. In fact, peptides 
201 to 2 05 exhibit IC S0 values in the nanogram/ml 
range. In addition, many of the DP178-like peptides 
exhibited some level of helicity. 

Thus, the computer assisted searches described, 
20 hereinabove, have successfully identified viral 

peptide domains that represent highly promising anti- 
HPIV3 antiviral compounds. 



19. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP178/DP107 ANALOGS IN SIMIAN 
25 IMMUNODEFICIENCY VIRUS 

FIG. 31 represents search results for SIV isolate 

MM251 (PC/Gene® protein sequence PENV_SIVM2) . Both 

107x178x4 and ALLMOTI5 search motifs identified two 

regions with similarities to DP107 and/or DP178. 

30 The peptide regions found by 107x178x4 were 

located at amino acid residues 156-215 and 277-289. 
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The peptide regions f ound by ALLMOTI5 were located at 
amino acid residues 156-219 and 245-286. Both motifs, 
therefore, identify similar regions. 

Interestingly, the first SIV peptide region 
( i.e. , from amino acid residue 156 to approximately 
5 amino acid residue 219) correlates with a DP107 

region, while the second region identified ( i.e. , from 
approximately amino acid residue 245 to approximately 
amino acid residue 289) correlates with the DP178 
region of HIV. In fact, an alignment of SIV isolate 

10 MM251 and HIV isolate BRU, followed by a selection of 
the best peptide matches for HIV DP107 and DP178, 
reveals that the best matches are found within the 
peptide regions identified by the 107x178x4 and 
ALLMOTI 5 search motifs. 

^ It should be noted that a potential coiled-coil 

region at amino acid residues 242-282 is predicted by 
the Lupas program. This is similar to the observation 
in HIV in which the coiled-coil is predicted by the 
Lupas program to be in the DP178 rather than in the 
DP107 region. It is possible, therefore, that SIV may 

20 be similar to HIV in that it may contain a coiled-coil 
structure in the DP107 region, despite such a 
structure being missed by the Lupas algorithm. 
Likewise, it may be that the region corresponding to a 
DP178 analog in SIV may exhibit an undefined 

25 structure, despite the Lupas program's prediction of a 
coiled-coil structure. 

20. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP178/DP107 ANALOGS IN EPSTEIN- BARR 
VIRUS 

30 The results presented herein describe the 

identification of DP178/DP107 analogs within two 
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different Epstein-Barr Virus proteins. Epstein-Barr 
is a human herpes virus which is the causative agent 
of, for example, infectious mononucleosis (IM) , and is 
also associated with nasopharyngeal carcinomas (NPC) , 
Burkitt's lymphoma and other diseases. The virus 
5 predominantly exists in the latent form and is 
activated by a variety of stimuli. 

FIG. 32 depicts the search motif results for the 
Epstein-Barr Virus (Strain B95-8; PC/Gene® protein 
sequence PVGLB_EBV) glycoprotein gpllO precursor 
10 (gpH5) . The 107x178x4 motif identified two regions 
of interest, namely the regions covered by amino acid 
residues 95-122 and 631-658. One PZIP region was 
identified at amino acid residue 732-752 which is most 
likely a cytoplasmic region of the protein. The Lupas 
algorithm predicts a coiled-coil structure for amino 

15 

acids 657-684. No ALLMOTI5 regions were identified. 

FIG. 33 depicts the search motif results for the 
Zebra (or EB1) trans-activator protein (BZLF1) of the 
above- identified Epstein-Barr virus. This protein is 
a transcription factor which represents the primary 

20 mediator of viral reactivation. It is a member of the 
b-ZIP family of transcription factors and shares 
significant homology with the basic DNA-binding and 
dimerization domains of the cellular oncogenes c-fos 
and C/EBP. The Zebra protein functions as a 

25 homodimer. 

Search results domonstrate that the Zebra protein 
exhibits a single region which is predicted to be 
either of DP107 or DP178 similarity, and is found 
between the known DNA binding and dimerization regions 

3q of the protein. Specifically, this region is located 
at amino acid residues 193-220, as shown in FIG. 33. 
The Lupas program predicted no coiled-coil regions. 
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21. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 

DP178/DP107 ANALOGS IN MEASLES VIRUS 

FIG. 34 illustrates the motif search results for 
the fusion protein Fl of measles virus, strain 
Edmonston (PC Gene® protein sequence PVGLF_MEASE) , 
5 successfully identifying DP178/DP107 analogs. 

The 107x178x4 motif identifies a single region at 
amino acid residues 228-262. The ALLMOTI5 search 
motif identifies three regions, including amino acid 
residues 116-184, 228-269 and 452-500. Three regions 
^ containing proline residues followed by a leucine 

zipper- like sequence were found beginning at proline 
residues 214, 286 and 451. 

The Lupas program identified two regions it 
predicted had potential for coiled-coil structure, 
which include amino acid residues 141-172 and 444-483. 

15 

22. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 

DP178/DP107 ANALOGS IN HEPATITIS B 
VIRUS 

FIG. 35 depicts the results of a PZIP motif 

search conducted on the Hepatitis B virus subtype AYW. 

20 Two regions of interest within the major surface 
antigen precursor S protein were identified. The 
first lies just C- terminal to the proposed fusion 
peptide of the major surface antigen (Hbs) which is 
found at amino acid residues 174-191. The second 

25 region is located at amino acid residues 233-267, The 
Lupas program predicts no coiled-coil repeat regions. 

In order to test the potential anti-HBV antiviral 
activity of these D178/DP107 analog regions, peptides 
derived from area around the analog regions are 
synthesized, as shown in FIG. 52A-B. These peptides 

3 0 

represent one amino acid peptide "walks" through the 
putative DP178/DP107 analog regions. The peptides are 
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synthesized according to standard Fmoc chemistry on 
Rinkamide MBHA resins to provide for carboxy terminal 
blockade (Chang, CD. and Meinhofer, J., 1978, Int. J. 
Pept. Protein Res. 11:246-249; Fields, G.B . and Noble, 
R.L., 1990, Int. J. Pept. Protein Res. 35:161-214). 
5 Follwing complete synthesis, the peptide amino- 

terminus is blocked through automated acetylation and 
the peptide is cleaved with trif luoroacetic acid (TFA) 
and the appropriate scavengers (King, D.S. et al., 
1990, Int. J. Pept. Res. 36:255-266). After cleavage, 
10 the peptide is precipitated with ether and dried under 
vacuum for 24 hours. 

The anti-HBV activity of the peptides is tested 
by utilizing standard assays to determine the test 
peptide concentration required to cause an acceptable 
( e.g. , 90%) decrease in the amount of viral progeny 
formed by cells exposed to an HBV viral inoculiftn. 
Candidate antivial peptides are further characterized 
in model systems such as wood chuck tissue culture and 
animal sytems, prior to testing on humans. 

20 23. EXAMPLE: COMPUTER-ASSISTED IDENTIFICATION OF 

DP178/DP107 ANALOGS IN SIMIAN MAS ON - 
PFIZER MONKEY VIRUS 

The results depicted herein illustrate the 

results of search motifs conducted on the Simian 

Mason-Pfizer monkey virus. The motifs reveal 

25 DP178/DP107 analogs within the enveloped (TM) protein 
GP2 0, as shown in FIG. 36. 

The 107x178x4 motifs identifies a region at amino 
acid residues 422-470. The ALLMOTI5 finds a region at 
amino acid residues 408-474. The Lupas program 

3 0 predicted a coiled-coil structure a amino acids 424- 
459. 
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24. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP178/DP107 ANALOGS IN BACTERIAL 
PROTEINS 

The results presented herein demonstrate the 
identification of DP178/DP107 analogs corresponding 
to sequences present in proteins of a variety of 
bacterial species. 

FIG. 37 depicts the search motif results for the 
Pseudomonas aeruginosa f imbrial protein (Pilin) . Two 
regions were identified by motifs 107x178x4 and 
ALLMOTI5. The regions located at amino acid residues 
10 30-67 and 80-144 were identified by the 107x178x4 

motif. The regions at amino acid residues 3 0-68 and 
80-125 were identified by the ALLM0TI5. 

FIG. 3 8 depicts the search motif results for the. 
Pseudomonas gonorrhoeae f imbrial protein (Pilin) . A 
15 single region was identified by both the 107x178x4 and 
the ALLMOTI5 motifs. The region located at amino acid 
residues 66-97 was identified by the 107x178x4 motif. 
The region located at amino acid residues 66-125 were 
identified by the ALLMOTI5 search motif. No coiled- 
coil regions were predicted by the Lupas program. 

2 0 

FIG. 3 9 depicts the search motif results for the 
Hemophilus Influenza f imbrial protein (Pilin) . A 
single region was identified by both the 107x178x4 and 
the ALLM0TI5 motifs. The region located at amino acid 
residues 102-129 was identified by the 107x178x4 
25 motif. The region located at amino acid residues 102- 
148 were identified by the ALLMOTI5 search motif. No 
coiled- coil regions were predicted by the Lupas 
program. 

FIG. 40 depicts the search motif results for the 
30 Staphylococcus aureus toxic shock syndrome Hemophilus 
Influenza f imbrial protein (Pilin) . A single region 
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was identified by both the 107x178x4 and the ALLMOTI5 
motifs. The region located at amino acid residues 
102-129 was identified by the 107x178x4 motif. The 
region located at amino acid residues 102-148 were 
identified by the ALLMOTI5 search motif. No coiled- 
5 coil regions were predicted by the Lupas program. 

FIG. 41 summarizes the motif search results 
conducted on the Staphylococcus aureus enterotoxin 
Type E protein. These results demonstrate the 
successful identification of DP178/DP107 analogs 
!0 corresponding to peptide sequences within this 
protein, as described below. 

The ALLMOTI5 motif identified a region at amino 
acid residues 22-27. The 107x178x4 motif identified 
two regions, with the first at amino acid residues 26- 
69 and the second at 88-115. A P12LZIPC motif search 

15 

identified two regions, at amino acid residues 1-63-181 
and 230-250. 

The Lupas program predicted a region with a high 
propensity for coiling at amino acid residues 25-54. 
This sequence is completely contained within the first 
20 region identified by both ALLMOTI5 and 107x178x4 
motifs . 

FIG. 42 depicts the search motif results 
conducted on a second Staphylococcus aureus toxin, 
enterotoxin A. Two regions were identified by the 
25 ALLM0TI5 motif, at amino acid residues 22-70 and amino 
acid residues 164-205. The 107x178x4 motif found two 
regions, the first at amino acid residues 26-69 and 
the second at amino acid residues 165-192. A P23LZIPC 
motif search revealed a region at amino acid residues 
216-250. No coiled-coil regions were predicted by the 

30 

Lupas program. 
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FIG. 43 shows the motif search results conducted 
on the E. coli heat labile enterotoxin A protein, 
demonstrating that identification of DP178/DP107 
analogs corresponding to peptides located within this 
protein. Two regions were identified by the ALLMOTI5 
5 motif, with the first residing at amino acid residues 
55-115, and the second residing at amino acid residues 
216-254. The 107x178x4 motif identified a single 
region at amino acid residues 78-105. No coiled-coil 
regions were predicted by the Lupas program. 

10 

25. EXAMPLE: COMPUTER-ASSISTED IDENTIFICATION OF 
DP178/DP107 ANALOGS WITHIN VARIOUS 
HUMAN PROTEINS 

The results presented herein demonstrate the 
identification of DP178/DP107 analogs corresponding to 
15 peptide sequences present within several different 
human proteins. 

FIG. 44 illustrates the search motif results 
conducted on the human c-fos oncoprotein. The 
ALLMOTI5 motif identified a single region at amino 
acid residues 155-193. The 107x178x4 motif identified 

20 

one region at amino acid residues 162-193. The Lupas 
program predicted a region at amino acid residues 14 8- 
2 01 to have coiled-coil structure. 

FIG. 45 illustrates the search motif results 
conducted on the human lupus KU autoantigen protein 

25 P70. The ALLMOTI5 motif identified a single region at 
amino acid residues 229-280. The 107x178x4 motif 
identified one region at amino acid residues 235-292. 
The Lupas program predicted a region at amino acid 
residues 232-267 to have coiled-coil structure. 

30 FIG. 46 illustrates the search motif results 

conducted on the human zinc finger protein 10. The 
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ALLMOTI5 motif identified a single region at amino 
acid residues 29-81. The 107x178x4 motif identified 
one region at amino acid residues 29-56. A P23LZIPC 
motif search found a single region at amino acid 
residues 420-457. The Lupas program predicted no 
5 coiled-coil regions. 

26. EXAMPLE: POTENTIAL MEASLES VIRUS DP178/DP107 
ANALOGS: CD AND ANTIVIRAL 
CHARACTER I Z AT I ON 

In the Example presented herein, measles (MeV) 

virus DP178-like peptides identified by utilizing the 

computer-assisted search motifs described in the 

Examples presented in Sections 9 and 21, above, are 

tested for ant i -MeV activity. Additionally, circular 

dichroism (CD) structural analyses are conducted on 

the peptides, as discussed below. It is demonstrated 

that several of the identified peptides exhibit "potent 

antiviral capability. Additionally, it is shown that 

none of the these peptides exhibit a substantial 

helical character. 

20 

26.1 MATERIALS AND METHODS 
Structural analyses : The CD spectra were 
measured in a lOmM sodium phosphate, 150mM sodium 
chloride, pH 7.0, buffer at approximately lOmM 
concentrations, using a 1 cm pathlength cell on a 
25 Jobin/Yvon Autodichrograph Mark V CD 

spectrophotometer. Peptide concentrations were 
determined from A^ using Edlehoch's method (1967, 
Biochemistry 6:1948) . 

Anti-MeV antiviral activity syncytial reduction 
30 assay: The assay utilized herein tested the ability 
of the peptides to disrupt the ability of Vero cells 



10 



15 
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acutely infected with MeV ( i.e. . cells which are 
infected with a multiplicity of infection of 2-3) to 
fuse and cause syncytial formation on a monolayer of 
an uninfected line of Vero cells. The more potent the 
peptide, the lower the observed level of fusion, the 
5 greater the antiviral activity of the peptide. 

Uninfected confluent monolayers of Vero cells 
were grown in microtiter wells in 10% FBS EMEM (Eagle 
Minimum Essential Medium w/o L-glutamine [Bio 
Whittaker Cat. No. 12-125F] , with fetal bovine serum 
10 [FBS ; which had been heat inactivated for 30 minutes 
at 56°C; Bio Whittaker Cat. No. 14 -50 IF) supplemented 
at 10%, antibiotics/antimycotics (Bio Whittaker Cat. 
No. 17-602E) added at 1%, and glutamine added at 1%. 

To prepare acutely infected Vero cells for 
addition to the uninfected cells, cultures of acutely 

15 

infected Vero cells were washed twice with HBSS* (Bio 
Whittaker Cat. No. 10-543F) and cell monolayers were 
removed with trypsin (Bio Whittaker Cat. No. 17-161E) . 
Once cells detached, media was added, any remaining 
clumps of cells were dispersed, and hemacytometer cell 

20 counts were performed. 

- The antiviral assay was conducted by, first, 
removing all media from the wells containing 
uninfected Vero cells, then adding peptides (at the 
dilutions described below) in 10% FBS EMEM, and 50-100 

25 acutely MeV-infected Vero cells per well. Wells were 
then incubated at 37°C for a maximum of 18 hours. 

On day 2, after cells in control wells were 
checked for fusion centers, media was removed from the 
wells, followed by addition, to each well, of 
approximately 50/zl 0.25% Crystal Violet stain in 
methanol. Wells were rinsed twice with water 
immediately, to remove excess stain and were then 
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allowed to dry. The number of syncytia per well were 
then counted, using a dissecting microscope. 

Anti-MeV antiviral activity plague reduction 
assay : The assay utilized herein tested the ability 
of the peptides to disrupt the ability of MeV to 
5 infect permissive, uninfected Vero cells, leading to 
the infected cells' fusing with uninfected cells to 
produce syncytia. The lower the observed level of 
syncytial formation, the greater the antiviral 
activity of the peptide. 

10 Monolayers of uninfected Vero cells are grown as 

described above. 

The antiviral assay was conducted by, first, 
removing all media from the wells containing 
uninfected Vero cells, then adding peptides (at the 
dilutions described below) in 10% FBS EMEM, and MeV 
stock virus at a final concentration of 3 0 plaque 
forming units (PFU) per well. Wells were then 
incubated at 37°C for a minimum of 36 hours and a 
maximum of 48 hours. 

On day 2, after cells in control wells were 

20 checked for fusion centers, media was removed from the 
wells, followed by addition, to each well, of 
approximately 50/xl 0.25% Crystal Violet stain in 
methanol. Wells were rinsed twice with water 
immediately, to remove excess stain and were then 

25 allowed to dry. The number of syncytia per well were 
then counted, using a dissecting microscope. 

Peptides: The peptides characterized in the 
study presented herein were peptides T-252A0 to T- 
256A0, T-257B1/C1, and T-258B1 to T-265B0, and T-266A0 

3o to T-268A0, as shown in FIG. 47. These peptides 

represent a walk through the DPl78-like region of the 
MeV fusion protein. 
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Each peptide was tested at 2-fold serial 
dilutions ranging from 100/zg/ml to approximately 
lOOng/ml. For each of the assays, a well containing 
no peptide was also used. 

26.2 RESULTS 
The data summarized in FIG. 47 represents 
antiviral and structural information obtained via 
"peptide walks" through the DP178-like region of the 
MeV fusion protein. 

As shown in FIG. 47, the MeV DP178-like peptides 
exhibited a range of antiviral activity as crude 
peptides. Several of these peptides were chosen for 
purification and further antiviral characterization. 
The IC 50 values for such peptides were determined, as 
shown in FIG. 47, and ranged from 1.35/ig/ml (T- 
257B1/C1) to 0.072/ig/ml (T-265B1) . None of the DP178- 
like peptides showed, by CD analysis, a detectable 
level of helicity. 

Thus, the computer assisted searches described, 
hereinabove, as in for example, the Example presented 
in Section 9, for example, successfully identified 
viral peptide domains that represent highly promising 
anti-MeV antiviral compounds. 

27. EXAMPLE: POTENTIAL SIV DP178/DP107 ANALOGS: 
ANTIVIRAL CHARACTERIZATION 

In the Example presented herein, simian 

immunodeficiency virus (SIV) DPl78-like peptides 

identified by utilizing the computer-assisted search 

motifs described in the Examples presented in Sections 

9, 12 and 19, above, were tested for anti-SIV 

activity. It is demonstrated that several of the 
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identified peptides exhibit potent antiviral 
capability. 

27.1 MATERIALS AND METHODS 
Anti-SIV antiviral assays : The assay utilized 
5 herein were as reported in Langolis et al. (Langolis, 
A.J. et al., 1991, AIDS Research and Human 
Retroviruses 7:713-720). 

Peptides : The peptides characterized in the 
study presented herein were peptides T-3 91 to T-400, 
10 as shown in FIG. 48. These peptides represent a walk 
through the DP178-like region of the SIV TM protein. 

Each peptide was tested at 2 -fold serial 
dilutions ranging from 100ptg/ml to approximately 
lOOng/ml. For each of the assays, a well containing 
no peptide was also used. 

15 

27.2 RESULTS 
The data summarized in FIG. 48 represents 
antiviral information obtained via "peptide walks" 
through the DP178-like region of the SIV TM protein. 
20 As shown in FIG. 48, peptides T-391 to T-400 were 

tested and exhibited a potent antiviral activity as 
crude peptides. 

Thus, the computer assisted searches described, 
hereinabove, as in for example, the Example presented 
25 in Section 9, for example, successfully identified 

viral peptide domains that represent highly promising 
anti-SIV antiviral compounds. 

28. EXAMPLE: ANTI-VIRAL ACTIVITY OF DP107 AND DP- 
178 PEPTIDE TRUNCATIONS AND MUTATIONS 

30 The Example presented in this Section represents 

a study of the antiviral activity of DP107 and DP178 
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truncations and mutations. It is demonstrated that 
several of these DP107 and DP178 modified peptides 
exhibit substantial antiviral activity. 

28.1 MATERIALS AND METHODS 
5 Anti-HIV assays : The antiviral assays performed 

were as those described, above, in Section 6.1. 
Assays utilized HIV-l/IIlb and/or HIV-2 NIHZ isolates. 
Purified peptides were used, unless otherwise noted in 
FIGS. 49A-C. 

10 Peptides : The peptides characterized in the 

study presented herein were: 

1) FIGS. 49A-C present peptides derived from 
the region around and containing the DP178 
region of the HIV-1 BRU isolate. 
Specifically, this region spanned from gp41 
amino acid residue 615 to amino acid residue 
717. The peptides listed contain 
truncations of this region and/or mutations 
which vary from the DPI 7 8 sequence amino 
acid sequence. Further, certain of the 
peptides have had amino- and/or carboxy- 
terminal groups either added or removed, 
as indicated in the figures; and 

2) FIG. 50. presents peptides which represent 
truncations of DP107 and/or the gp41 region 

25 surrounding the DP107 amino acid sequence of 

HIV-l BRU isolate. Certain of the peptides 
are unblocked or biotinylated, as indicated 
in the figure. 
Blocked peptides contained an acyl N-terminus and 
an amido C-terminus. 

30 
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28.2 RESULTS 
Anti-HIV antiviral data was obtained with the 
group 1 DP178 -derived peptides listed in FIG. 49A-C. 
The full-length, non-mutant DP178 peptide (referred to 
in FIG. 49A-C as T20) results shown are for 4ng/ml. 
5 In FIG. 49A, a number of the DP178 truncations 

exhibited a high level of antiviral activity, as 
evidenced by their low IC 50 values. These include, for 
example, test peptides T-50, T-624/ T-636 to T-641, T- 
645 to T-650, T-652 to T-654 and T-656. T-50 

10 represents a test peptide which contains a point 
mutation, as indicated by the residue's shaded 
background. The HIV- l- derived test peptides exhibited 
a distinct strain-specific antiviral activity, in that 
none of the peptides tested on the HIV- 2 NIHZ isolate 

^ demonstrated appreciable antti-HIV-2 antiviral 
activity. 

Among the peptides listed in FIG. 49B, are test 
peptides representing the amino (T-4) and carboxy (T- 
3) terminal halves of DP178 were tested. The amino 
terminal peptide was not active (IC 50 >400^g/ml) whereas 

20 the carboxy terminal peptide showed potent antiviral 
activity (IC 50 = 3/ig/ml) . A number of additional test 
peptides also exhibited a high level of antiviral 
activity. These included, for example, T-61/T-102, T- 
217 to T-221, T-235, T-381, T-677, T-377, T-590, T- 

25 378, T-591, T-271 to T-272, T-611, T-222 to T-223 and 
T-60/T-224. Certain of the antiviral peptides contain 
point mutations and/or amino acid residue additions 
which vary from the DP178 amino acid sequence. 

In FIG. 49C, point mutations and/or amino and/or 
carboxy- terminal modifications are introduced into the 

30 

DP178 amino acid sequence itself. As shown in the 
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figure, the majority of the test peptides listed 
exhibit potent antiviral activity. 

Truncations of the DP107 peptide (referred to in 
IG. 50 as T21) were also produced and tested, as shown 
in FIG. 50. FIG. 50 also presents data concerning 
5 blocked and unblocked peptides which contain 

additional amino acid residues from the gp41 region in 
which the DP107 sequence resides. Most of these 
peptides showed antiviral activity, as evidenced by 
their low IC 50 values. 
. xo Thus, the results presented in this Section 

demonstrate that not only do the full length DP107 and 
DP178 peptides exhibit potent antiviral activity, but 
truncations and/or mutant versions of these peptides 
can also possess substantial antiviral character. 

15 

29: EXAMPLE: POTENTIAL EPSTEIN-BARR DP178/DP1.07 

ANALOGS: ANTIVIRAL CHARACTERIZATION 

In the Example presented herein, peptides derived 
from the Epstein-Barr (EBV) DP-178/DP107 analog region 
of the Zebra protein identified, above, in the Example 
20 presented in Section 20 are described and tested for 
anti-EBV activity. It is demonstrated that among 
these peptides are ones which exhibit potential anti- 
viral activity. 



29.1 MATERIALS AND METHODS 

25 

Electrophoretic Mobility Shift Assays (EMSA) : 

Briefly, an EBV Zebra protein was synthesized 

utilizing SPG RNA polymerase in vitro transcription 

and wheat germ in vitro translation systems (Promega 

Corporation recommendations,* Butler, E.T. and 

30 Chamberlain, M.J., 1984, J. Biol. Chem. 257:5772; 

Pelham, H.R.B. and Jackson, R.J., 1976, Eur. J. 
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Biochem. 67:247) . The in vitro translated Zebra 
protein was then preincubated with increasing amounts 
of peptide up to 250 ng/ml prior to the addition of 
10,000 to 20,000 c.p.m. of a 32 P-labeled Zebra response 
element DNA fragment . After a 20 minute incubation in' 
5 the presence of the response element, the reaction was 
analyzed on a 4% non- denaturing polyacrylamide gel, 
followed by autoradiography, utilizing standard gel- 
shift procedures. The ability of a test peptide to 
prevent Zebra homodiraer DNA binding was assayed by the 

10 peptide's ability to abolish the response element gel 
migration retardation characteristic of a protein- 
bound nucleic acid molecule. 

Peptides: The peptides characterized in this 
study represent peptide walks through the region 
containing, and flanked on both sides by, the 
DP178/DP107 analog region identified in the Example 
presented in Section 20, above, and shown as shown in 
FIG. 33. Specifically, the peptide walks covered the 
region from amino acid residue 173 to amino acid 
residue 246 of the EBV Zebra protein. 

20 Each of the tested peptides were analyzed at a 

range of concentrations, with 150ng/ml being the 
lowest concentration at which any of the peptides 
exerted an inhibitory effect. 



25 29.2 RESULTS 

The EBV Zebra protein transcription factor 
contains a DP178/DP107 analog region, as demonstrated 
in the Example presented, above, in Section 20. This 
protein appears to be the primary factor responsible 

3Q for the reactivation capability of the virus, A 

method by which the DNA-binding function of the Zebra 
virus may be abolished may, therefore, represent an 
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effective antiviral technique. In order to identify 
potential anti-EBV DP178/DP107 peptides, therefore, 
peptides derived from the region identified in Section 
20, above, were tested for their ability to inhibit 
Zebra protein DNA binding. 
5 The test peptides 1 ability to inhibit Zebra 

protein DNA binding was assayed via the EMSA assays 
described, above, in Section 28.1. The data 
summarized in FIG. 51A-B presents the results of EMSA 
assays of the listed EBV test peptides. These 

10 peptides represent one amino acid "walks" through the 
region containing, and flanked on both sides by, the 
DP178/DP107 analog region identified in the Example 
presented in Section 20, above, and shown as shown in 
FIG. 33. As shown in FIG. 51A-B, the region from 
which these peptides are derived lies from EBV Zebra 
protein amino acid residue 173 to 246. A number of 
the test peptides which were assayed exhibited an 
ability to inhibit Zebra protein homodimer DNA 
binding, including 439, 441, 444 and 445. 

Those peptides which exhibit an ability to 

20 inhibit Zebra protein DNA binding represent potential 
anti-EBV antiviral compounds whose ability to inhibit 
EBV infection can be further characterized. 

30. EXAMPLE: IDENTIFICATION OF RSV DP107/DP178 
ANALOGS WITH REDUCED BINDING 
25 AFFINITY 

In the example presented herein, peptides derived 

from the RSV DP178 analog T112 are described and 

tested for binding affinity to the DP107-like domain 

of the RSV Fl -protein. Particular peptides are 

30 identified that have a reduced binding affinity for 

their DP107-like target, and key amino acid residues 
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are identified the confer high binding affinity to the 
native peptide (i.e., to T112) . Such peptides are 
useful, e.g., in screening assays such as those 
described above in Section 5,6.1 to identify compounds 
which inhibit or disrupt the interaction between DP107 
5 and DP178, and in providing guidance for generation of 
additional peptides exhibiting reduced affinity 
binding . 

3 0.1 MATERIALS AND METHODS 
10 A maltose binding fusion protein of the RSV Fl- 

protein (MF5.1) was constructed using methods similar 
to those described in Section 8.1.2, supra, for 
construction of the M41 fusion protein. Specifically, 
the DNA sequence corresponding amino acid residues 
142-302 of the RSV Fl protein was amplified by PCR and 

15 

cloned into the Xmn I site of the expression vector 
pMal-p2 (New England Biolab) to give MF5.1. These 
amino acid residues correspond to the extracellular 
domain of the RSV Fl protein including its DP10 7 
region but excluding the DP178 region. 

2 0 

The peptides characterized in the study presented 
herein were: T122, T800, T801, T802, T803, T804, 
T805, T806, T807, T808, T809, T810, T811, T1669, 
T1670, T1671, T1672, T1673, T1680, T1681, T1682, T1683 
and T1684, as shown in FIG. 53. T112 represents the 

25 DP178-like region of the RSV Fl protein. The other 
peptides characterized are modified DP178 proteins 
derived from T112 . 

Cell fusion assays were performed with each of 
the peptides as described in Section 17 above. The 

3Q binding affinity of each peptide was also measured in 
a competitive binding assay described in Section 5.6.1 
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above, wherein the concentration of each peptide 
necessary to bind to the M5.1 fusion protein (i.e., 
the B 50 value) , and thereby disrupt binding of biotin 
labeled T112 (T888) to the fusion protein, was 
measured . 

5 

30.2 RESULTS 
T112 is a 35 amino acid residue peptide that 
corresponds to amino acid residues 482-516 of the RSV 
Fl protein and has the following amino acid sequence: 

10 

VFPSDEFDASISQVNEKINQSLAFIRKSDELLHNV 

The peptide represents the DP178-like region of the 
RSV Fl protein and has substantial, antiviral activity 
against RSV as discussed in Section 17.2 above and 
shown in FIG. 28A. 

T112 analogs were generated according to at least 
three different strategies to generate peptides based 
on T112 that would still bind to the DP107-like domain 
of the RSV Fl protein but with a lower binding 
20 affinity. First, a truncated peptide was generated, 
reducing the length of the peptide from 3 5 to 2 8 amino 
acid residues. Specifically, the truncated peptide, 
which is referred to herein as T67, had the amino acid 
sequence : 

25 

DEFDASISQVNEKINQSLAFIRKSDELL 

corresponding to amino acid residues 486-213 of the Fl 
fusion protein. The binding affinity of the peptide 
30 to the DPl07-like domain of Fl protein was determined 
according to the methods described in Section 5.6.1 
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above. The truncated peptide had a binding affinity 
(5 nM) that was five times lower than that of the full 
length T112 peptide (2 nM) . 

As part of a second strategy, the peptides 
identified as T800 through T811 in PIG. 53 were 
5 synthesized to identify particular amino acids in T112 
that contribute to a larger part of that peptide's 
binding affinity. As a whole, these alanine 
substitutions represent an "alanine -scanning" type 
walk across the sequence of T112 . 

10 Each of the peptides synthesized had a change of 

three consecutive amino acid residues in the T112 
sequence to three alanine residues. Each peptide was 
tested for its ability to inhibit the binding of the 
native peptide (i.e., of T112) in a competitive 
binding assay as described in Section 5.6.1 above. 
The results are also shown in FIG. 53. In particular, 
the peptides T802, T804, T807 and T810 had 
significantly reduced affinity for the DPl07-like 
target, suggesting that the regions containing amino 
acid residues 488-490, 494-496, 503-505 and 512-514 of 

20 the RSV Fl protein (amino acid residues 7-9, 13-15, 
22-24 and 31-33, respectively, of T112) , contribute 
significantly to the high binding affinity of T112 for 
its DPl07-like target in the RSV Fl protein. 

The peptides T1669-T1673 and T1680 through T1684 

25 were then synthesized, each of which contains a single 
alanine substitution at one of the above- listed amino 
acid residue positions of T112. The binding affinity 
of these peptides for their DP107-like target can also 
be determined by means of the same routine screening 

3q assays, thereby identifying individual amino acid 
residues which affect binding affinity of T112. 
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In addition, an additional novel peptide, 
referred to as T786, was generated by modifying 
various amino acid residues in the T112 sequence which 
were identified, using standard principles of protein 
and design, as affecting properties such as binding 
5 affinity, solubility and biological stability. 
Specifically the following amino acid residue 
substitutions were made: F 2 - Y, S 2X - A, F 24 - Y and 
S 28 - A, wherein the subscript numerals indicate the 
amino acid residue position in T112 . The resultant 
10 peptide, which is referred to herein as T786, thus had 
the amino acid sequence: 

VYPSDEFDAS ISQVNEKINQALAYIRKADELLHNV 

The binding affinity of this novel peptide for 
the DP107 target was found to be 19 nM, i.e., 
approximately ten- fold less than the binding affinity 
of T112. 

The data demonstrates that peptides having a 
reduced binding affinity for a DP107 target (i.e., for 

2 0 

an HR1 domain) may be readily found by modifying a 
DP178 peptide such as T112, e.g., by means of the 
routine techniques and assays described herein. 
Further, the techniques and assays identify key amino 
acid residues which may be used to construct and 
25 identify other reduced affinity peptides. 

31. EXAMPLE: IDENTIFICATION OF HIV DP107/DP178 
ANALOGS WITH REDUCED BINDING 
AFFINITY 

In the example presented herein, peptides derived 

30 from DP178, which is also referred to as T20, are 

described and tested for binding affinity to the DP107 
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domain of the HIV gp4l. Particular peptides are 
identified that have a reduced binding affinity for 
their DP107 target, and key amino acid residues are 
identified the confer high binding affinity to the 
native peptide (i.e., to T20) . Such peptides are 
5 useful, e.g., in screening assays such as those 

described above in Section 5.6.1 to identify compounds 
which inhibit or disrupt the interaction between DP107 
and DP178. 

Specifically, the peptides identified as T813 and 
10 T868 through T878 in FIG. 53 were synthesized to 

identify particular amino acids in T20 (DP178) that 
contribute to a greater part of that peptide's binding 
affinity. Each of the peptides synthesized had a 
change of three consecutive amino acid residues in the 
T20 sequence to three alanine residues. The antiviral 
activity of each peptide was assayed in cell fusion 
assays as described in Section 6.1.3, above. The 
binding affinities of the peptides were also measured 
in a competitive binding assay described in Section 
5.6.1 above, wherein each peptides ability to disrupt 
the binding of either biotin (T83) or fluorescein 
(T1342) labeled DP178 (T20) to the M41A178 fusion 
protein described in Section 8, above, was measured. 
The binding affinity of each peptide to the peptide 
referred to as T764 
25 (GSTMGARSMTLTVQARQLLSGIVQQNNLLRAIEAQQH) also measured 
using circular dichroism to monitor the amount of 
secondary structure (i.e., helicity) adopted by the 
peptides. T764 is a peptide which represents the 
DP107 target domain of DP178 (T20) . 
3Q The results are provided in FIG. 54. In 

particular, the peptides T813, T878, T874-T876 and 
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T871 have significantly reduced affinity for the DP107 
region, suggesting the regions corresponding to the 
substituted amino acid residues in those peptides 
contribute significantly to the high binding affinity 
of T20. The peptides T1627-T1632, T1650-T1653 and 
5 T1656-T1665 were then synthesized. Each of these 

peptides contains a single alanine substitution at one 
of the amino acid residues in one of the regions 
identified as contributing significantly to the high 
binding affinity of T20. Identical assays which 

10 measured the binding affinity of these peptides 

identified four essential residues (I €46 , Q 652 , Q 653 and 
N 656/ with the subscript numerals indicating the 
residue position in the HIV-1^ gp41 amino acid 
sequence) in which alanine-substitution completely 
prevented binding to the DP107 domain, as well as five 
residues (L 641 , I 642 , l 64S , E 657 and L 663 , with the * 
subscript numerals indicating the residue) in which 
alanine-substitution position in the HIV-1^ gp41 
amino acid sequence) that reduced the binding affinity 
but did not actually block binding to the DP107 

20 domain. 

The data demonstrates that peptides having a 
reduced binding affinity for a DP107 target (i.e., for 
an HR1 domain) may be readily found by modifying a 
DP178 peptide such as T20, e.g., by means of the 
25 routine techniques and assays described herein. 

Further, the techniques and assays identify key amino 
acid residues which may be used to construct and 
identify other reduced affinity peptides. 
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The present invention is not to be limited in 
scope by the specific embodiments described which are 
intended as single illustrations of individual aspects 
of the invention, and functionally equivalent methods 
and components are within the scope of the invention. 
5 Indeed, various modifications of the invention, in 
addition to those shown and described herein will 
become apparent to those skilled in the art from the 
foregoing description and accompanying drawings. Such 
modifications are intended to fall within the scope of 
10 the appended claims. 
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WHAT IS CLAIMED IS: 

1. A method for identifying a compound that 
inhibits the formation of or disrupts a DP107/DP178 

5 complex comprising: 

(a) preparing, both in the presence and in the 
absence of a test compound, a reaction 
mixture containing a DPI 07 peptide and a 
DP178 peptide under conditions and for a 

10 time sufficient to allow formation of a 

DP107/DP178 complex; and 

(b) detecting the formation of a DP107/DP178 
complex both in the presence and in the 
absence of the test compound, 

^ wherein the formation of a DP107/DP178 complex in the 
absence, but not in the presence of the test compound 
indicates that the compound inhibits the formation of 
or disrupts a DP107/DP178 complex. 

2. The method of Claim 1, wherein the DP107 
20 peptide or the DP178 peptide is a modified DP107 or 

DP17 8 peptide. 

3 . The method of Claim 2 wherein the modified 
DP107 or DP178 peptide has a reduced binding affinity. 

25 



30 
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WKQVCHTTVP WQWNNRTTDW vNNMT *WLE AWEROISYLEGNIT 



TQLEEARAQEEKNLP* AYOKLSS* WSDFWSWv FDF A SICWLN <XLK 
»Transmcmbrane,Rcgion ♦ 

IGFLPVLGnGLRLLYTV 4 XS+ C1ARVRQGYS PLSPQUBHP WKGQPDNAEG 



PGEGGDKRKN SSEPWQKESG TAEWKSNWCK RLTNWCSISS rWLYNS 
vALLMOTI5v 

vCLTL LVHLRSAFQY IQYGLGELKA AAQEAWALA RLAQNAGYQIWLv 
ACRSAYRA IIMSPRRVRQ GLEGILN 



FIG. 22 
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Fusion 



Peptide VALLMOTI5V 



« LVS Coilcd-Coi l* 



EA.C vYYL AGVALGVATA AQITAGIALHQ 4+ SNLNAOAIO 



i&JSTSLK QSNK AIEEI RE AT Q ET VI A* YQGYQ BY.* VNNELy VP 

vALLMOTI5v 

*P6 & 12LZIPC* 

AMQHMSCELVGQRLGLRLLRYYTELLSIFGPSLRD +PISA »v EISIOALIYAL 



GGEnnOLEKLGYSGSP » MJAILESRGIK.TKI v TIIVDLPGKF IILSISY 



+P1 & I2LZIPC* 

+PTESEVKGV1VHRLEAV+ SYNIGSQEVATTVPRYIATNGYI.ISNFDESSCVFVS 



ESAICSQNSL YPMSPLLQQG IRGDTSSCAR TLVSGTMGNK F1LSKGNIVA 



NCASILCKCY STSTIINQSP DKLLTFIASD TCPLVEIDGA TIQVGGRQYP 



♦LVS Coilcd-Coil* 
v ALLMOTI5 v 
+P12&23LZIPC+ 
DMVYEGRVALG +PAISLD vRL* PVGTNLGNALKia,nDAKVLI * 



nss* 



ISv* SFN 



♦ Transmembrane Reg ion + 
4 FGSLL SVPILSCTAL AUXMYGC * 



K RRYQQTLKQH TKVDPAFKPD LTGTSKSYVR SL 



FIG. 23 



PCI7US00/35727 
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Fusion vALLMOTI5v 

Peptide +JiLZil2Sx4* 
v FIGAI IGSVALGVA TAAQITAASA LIQANQNAAN AlLRLKESITA 



TIEAVHICVTDGLSOLAVA A VGKMv QQFVNDQFNNTAQELDCIKJTQQV 



vALLMOllSv 

GVELNLYLTELTTV FGPQITSPAL vTQLTIQALYNAGGNMDYLLTKLGVG 



+P1 & 12LZIPC* 

NNQLSSLIGSGLIT GN v *PILYDSQT QLLGIQVTLP SVGNLNNMRATYLET 



LSVST TKGFASALVP KWTQVGSVI EELDTSYC1E TDLDLYCTIU VTFPMSPGIY 
SCLNGNTSAC MYSKTEGALT TPYMTLKGSV IANCKMTTCR CADPPGDSQ 



VALLMOTI5V 

NYGEAVSLID RHSCN * Y VLSLD GITLRLSGEF DATYQKN1SI LDSQVIVTG 



♦LVS Coilcd-Coil* 4 Tn»ns- 
*HLPISTELGNY HHglSMLBK T,EESNSKLP K VT WKLTSTSA 4 LET* YJA 



lion * 

LTAISLVCGn,ST.V vA LACYT.MY 4 KQKAQQKTLLWLGNNTLGQMRATrTCM 



FIG. 24 
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Fusion vALLMOTI5v 
Peptide * 1107x178x4* *LY£ 
.I<F_C_GY *1G VBALS « VATSAOITAAVALVEAKOARSDIEKLKI. 



AnHlTNICAVOSVOSSrGNLTVAIKSVO* PYVWCE v* IWSIARLGCEAAG 



vALLMOTISv 

LQLGIALTQH av YSELTNIFGDNIGSLOEKGIICLOGIASLYRTNTTE v a 



+P5 & 12LZIPC* 

IFTTSTVDKYD1YDLLFTESIKVRVIDVDLNDYSITLQVRL * PLLTRLLNTQrYR 



VDSISYNl* QNREWYU PLPSII1MTKGAFLGGADVKECIEAFSSYIC 

PSDPGFVLNHEMESCLSGNISQCPRTWKSDIVPRYAFVNGGWANCITT 

TCTCNGIG^^lINQPPDQGVKIITI^KEC^mGINGMLF^rrNKEGTLAPYTP 



vALLMOTISv 
a 110 7 x178 x 4 a 
+P6 & 23LZIPC* 

NDITLNNSVALD +PIDI + SIELN v KAKSPLEESKEWl * RRSNOKI, * 



4 Transmembrane Reg ion ♦ 
PSICNWHQSSU LIM IIILFIINVTIU HAVKYYv R 

IQKRNRVDQN DKPYVLTNK 



FIG. 25 
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Fusion 
Peptide 

GLFGAI AGFIENGWl-GMtDG WYGFR.I IQNSEGTG 



y ALLMOT15 v 
'LyS.Coilc^Coil* 



WJEKYi^DIKIIlL* WRYN AKLLVALENQIITI » DLTv DSEMNKLFEKTR 
RQLREN AEEMG N GCFKI Yl IKCDN ACIESHW GTYDI© VYRDEALNNRFQDCG 



VELK.SGYKDWILWISFAISCFLLCWLLGFIMWACQRGNIIICNICI 



FIG. 26 
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FUSION VALLMOTI5V 
PEPTIDE 4 107x178x44 
.RNKRGVFVLGFLGFLATAGSAMGAAS 4Y XXXXAOSRTLLAG1VOOOOO 



1XDVVKROOELLRLTVWGTKNLOTRVTAIEKYLKDOAOL 4NAWGY CAP 



VALLMOTtSr 

•LVS PREDICTED COILED-COIL 
RQVCI 1T1VPWPNASLTPDW *NND VTWQEWERKVDFLEENITALLEEAQIQQ 



4 107x178x4 4 

KKNMY 4 ELQKLNSWI) * VFY GNXXXXXXXXXXXXXXXXXXXXXXXXXXX 4 



. IYIVMI,AKLRQGYRPVF.SSPI'.SYFQXTIITQQDPALPTREGKEGDGGEGGGNSSWI' 
WQIEYIHF 
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MTRRRVI.SVVVLLAALACRLGAQTPEQPAPPATTVQPTATRQQTSFPFRVCELSSHGDLFRFSSD 

I QCPSP GTRENI ITEGLLMVFKDN 1 1 PYSF 4 KVR$YTKIVTNILIYNGWYADSVTNRHE 4 
EKISVUSY ETDQMDTIYQ CYNAVKMTKD GLTRVYVDRD GVNITVNLKP TGGLANGVRR 
YASQTEI.YDA PGWLIWTYRT RTTVNCLITD MMAKSNSPFO FFVTTTGQTV EMSPFYDGKN 
KETFHF.RADS FHVRTNYK1V OYDNRGTNPQ GERRAFLDKG TYTLSWKLEN RTAYCPI.QHW 
QTFDSTIATE TGKSIHFVTD EGTSSFVTNT TVGIELPDAF KCIEEQVNKT HEKYEAVQD 
RYIKGOEAIT YFITSGGLLL AWLPLTPRSL ATVKNLTELT TPTSSPPSSP SPPAPSAARG 
STPAAVI.RRR RRDAGNATTP VPPTAPGKSL GTLNNPATVQ IQFAYDSLRR QINRMLGOLA 
RAWCLEQKRQ NMVLRELTKl NPTTVMSSIY GKAVAAKRLG DVISVSQCVP VNQATVTLRK 
• SMRVPGSETM CYSRPLVSFS FINDTKTYEG QLGTDNEIFL TKKMTEVCQA TSQYYFQSGN 

4107x178x4* 

EIIIVYNDYHH FKT1ELDGIA TLQTFISLNT i SLIENIDFASLELYSRDEQRASNVFD *LE4 

*LVS PREDICTED COILED COIL* TM Potential 

GI FREYNFQAQN I AGLRKDLDNAVSN* GRNQ FVDGLGELMDSLGSVG QSITN 

*P12LZIPC* 

TM Potential TM Potential 

LVSTVGGLFSSLVSGFISF TK N *PFGGMLILVLVAGVVILVISL* TRRTRQMS 
QQPVQMLYPG IDELAQQHAS GEGPGINPIS KTELQAIMLA LHEQNQEQKR AAQRAAGPSV 
ASRALQAARORFPGLRRRRY I IDPE f AAALL GEALTEF 
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MMDPNSTSLU VKFTPDPYQV PFVQAFDQAT RVYQDLGGPS QAPLPCVLWP VLPEPLPQGO 

LIAYIIVS1AP TGSWFSAPQP APLNAYQAYA APQLFPVSDI TQNQQTNQAG GEAPQPGDNS 

TVQTAMVVF ACPGANQGQQ LADIGVPQPA PVAAPARRTR KPQQPESLEE COSELEl 

PUNA BINDING? 4 107x178X44 +D1MERIZAT10N+ 

PKRY KNRVASRKCRAK 4F1P Q + LLQHYREVAAAKSSENDRLRLLLKO * 

MCPSLDVD+ SI IPRTPDVLIIE DLLNF 
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FUSION 

PEPTIDE VAILM0TI5V * LVS COILED-COH * 

FAG VVVLAGAALGVATAAQITAGIALHQSML* NSOAIDN[ RASIFTTN 



OA I EA I RQAGQEM I *L AVOGVQDY I NNV EL I PSMNQLSCDL IGQKLGLKLLRYYT 



*P23LZIPC* 
*P6,12LZIPC* 

4 107X17 SX4* 

VALLM0TI5V 

LI! SLIGPSLRD *PISA 4V EISIQLSYALGGDINKV * LEKLGYSGGDL * ' 

*P1,12LZIPC4 

LSJ1ES* RGIKARIV TH VOTES YF I VLS I AY *PTLSEIKGVIVHRLEGV* SY 

NIGSQEWYTTVPKYVATQGYLISNFDESSCTFMPEGTVCSQNALYPMSPLLQECL 

RGSTKSCARTLVSGSFGNRFILSQGNLIANCASILCKCYTTGTIINQDPOKILTYIAA 

4-P23LZIPC* 
*P12LZIPC* 

YALLM0TI5T 

*IVS COILFP-COIi * 
DHCPVVEVNGVTIQVGSRRYPDAVYIHRIDLGP *P ¥IS *LERLDVGTNI GN 

♦ TRANSMEMBRANE REGION * 
AIAKLEPAKELL* £SSDQ1*L* RSMK ♦GLS STSIVY ILIV AVCLGGLIGIP 

AU CCC+ RGRCNKKGEQVGMSRPGLKPDLTGTSKSYVRSL 
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Pre S I and Pre S2 

MC.QNLSTSNPLGFFPDIIQI.DPAFRANTANPDWDFNPNKDTWPDANKVGAGAFG 
IXiFrPPIIGGLLGWSPQAQGILQrLPANPPPASTNRQSGRQPTPLSPPLRNTMPOAM 
QWN.srnnQTLQDPRVKGLYI-PAGGSSSGTVNPVLTlASPLSSIFSIUGDPALN 



MAJOR SURFACE ANTIGI-N(I IDs) 
FUSION 
PEPTIDE 

4-P12&23LZIPC* 

MI.-NITSG FLG *PLL VI.QAGFFELTRILTl* PQSLDSWWTSLNFLGGTTVCLG 



♦PI2&23LZIPC* 

QNSQSPTSNHSPTSCPPTC ♦PGYRWMCLRRFIIFLFILLLCLIFLLVLLDYQGML* 
I'VCPl.IPGSSITSTGPCRTCM'FrAQGTSMYPSCCCTKPSDGNCTCIPIPSSWAFGKF 



♦ TRANSMEMBRANE REGION S 
I.WEWASARFSWLS ♦ LLVPFVOWFVGLSPTVWLSV Ii WMMWYWGPSI. 



♦ TRANSMEMBRANE REGION S 
♦ YSILSPFLPLLPIFFCLWVYI ^ 
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FUSION V ALLMOTI5 * * 107x 178x4 4 
PEPTIDE *LVS COILED COIL 

AIQI.IPI.FVG LGI riTAVSTGAAGLOVS *H * OYTKLSMOLISDV 

O AISSTIODLODOVDSLAKVVLO * NKKCLDLLTAE * QGGW 

CI.AI.QEKCCFYANKSGIVRDKIKNLQDDLERRRRQLIDNPFWTSFHG 

II .PYVMPLLGPLLCLLI .VLSFGPIIFNKLMTFIKHQIESIQAKPIQVHYI I 

TRANSMEMBRANE REGION 
RLEQEDSGGSYLTLT ?????????????????????????.... 

FIG. 36 
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MKAQKGFTLI ELMIVVAIIG ILAAIAPGQ 



4 107x178x44 
V ALLMOTI5V 

4V YODYTARTOVTRAVSEVSALKTAAESAILEGKEIVSSA 4 T¥ 



PK DTQYDJOFT 



4 107x178x44 
YALLMOT15V 

4V ESTLLDGSGKSOIOVTDNODGTVELVATLGKSSGS 4 AIKGAVITSRV 



KNDGV WNCKITKTPT AWKPNYAPAN CPKS 



FIG. 37 
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MNTLQKGFTL JIZLMIVIA1V G1LAAVALPA YQDYTARAQV 



SCAILLAI-GQ KSAVTI- YYLN HGIWP 

4 107x178x4 4 
V ALLMOTI5 V 

4V KDNTSAGVASSSSIKGKYVKEVKVENGVVTAT 4 



MNSSNVNKEIQGKKLSLWAKRQDGSVKWV 



FCGQP VTRNAKDDTV TADATGNDGK IDTKHLPSTC RDNFDAS 



FIG. 38 
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MKKTLLGSLI LLAFAGNVQA DINTETSGKV TFFGKVVENT 

CKVKTEIIK.NL SVVLNDVGKN SLSTKVNTAM PTPFTITLQN 
CDPTTANGTA NKANK.VGLYF Y 

www * 

VALLMOT15* 

4V SWKNVDKENNFTLKNEOTTADYATNVN1 * 

QLMESNGTKAISVVGKETEV 

DF MHTNNNGVAL NQTIIPNNAHI SGSTQLTTGT NELPLHFIAQ 
YYATNKATAG KVQSSVDFQI AYE 

FIG. 39 
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MNKKLLMNFF 1VSPl.U,ATT ATDFTPVP 

4 107x178x44 
VALLMOTI5V 

4V LSSNOHKTAKASTNDNIKDLLDVVYSSGSDTFTNS 4V 

KVLDNSL GSMRIKNTDG S1SMIPPSP YYSPAFTKGE KV 
4 107x178x4 4 

4 DLNTKRTKKSOHTSEGTY1HFOISCVT 4 

N TEKLPTPIEL PLKVKVMGKD SPLKYG 
♦P12LZIPC* 

* PK FDKKQL A I STLDFE1 RHQLTQ1 * 

IIGLYRSSDKT GGYWKITMND GSTYQSDLSK KFEYNTEKPP 
1N1DE1KT1E AE1N 

FIG. 40 



WO 01/51673 



PCTAJS00/35727 



¥ALLMOT15¥ 

MKKTAFILLL FIALTLTTSP L VVNG 

4 107x178x4 4 

♦LVS PREDICTED COILED-COIL* 

*S 4 KKSEEINEKDLRKKSEL0RNALSNLR01V * YYNEKAITENKESDD 4 



QFLENTLLV FKG FFTGIIPW 



4 107x178x4 4 

4 YNDLLVDLGSKDATNKYKGKKVDLYGAV 4 



YGYQCAGGTPNKTACMYGGVTLHDN NRLTEEKKVP INLWIDGKQTTV 
• *P12LZIPC4> 

4-PIDKVKTSK.KEVTVQELDL* QARHYLHGK FGLYNSDSFGGKVQ 



♦P12LZIPC* 

ROI.IVF HSSEGSTVSY DLFDAQGQY *P DTLLRIYRDN KTINSENLHI* 



DEYEYTT 

FIG. 41 
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VALLMOTI5* 

MKKTAFTLLL FIALTLTTSP L YVNGS 

4 107x178x4 4 

4 KKSEEINEKDLRKKSELOGTALGNLKOIYYYNEKAKTENKESHD 4 QV 



FLQUT1LFKG FFTDUSWYND LLVDFDSKD1 VDKYKGKKVDLYGAYY 



GYQC AGGTPNKTAC MYGGVTLIIDN NRLTEEKKVPINL WLDGKQNTV 



4 107x178x4 4 
VALLMOTI5V 
*PI2LZIPC* 

*P VL 4 ETVKTNKKNVTVOELDLOARRVL * OBKYNLYN4 



SDVFDGKVQRV GLIVF IITST1I 
*P23LZIPC* 

4-PSVNYDLFGAQGQYSNTLLRIYRDNKTINSENMIII* DIYLYTS 



FIG. 42 
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MKNI IT1FFIIXASPLYANGDRLYRADSRPPDEIKRFRSLMPRGNEYFDRGT 
YALLMOTI5V 

¥QMNINLYDHARGTQTGFVRYDDGYV 
♦ 107x 178x4 4 

4 STSLSLKSAHLAGOY1LSGYSLT1Y1VI * ANMFNVNDVISVYV 

SP HPYEQEVSAL GGIPYSQIYG WYRVNFGV1D ERLHRNREYR 

DRYYRNLNIA PAIZDOYRLAG FPPDHQAWRE EPWIHHAPQG 

CGDSSRTITG DTCNi: 
VALLMOTI5* 

YETQNLSTIYLREYQSKVKRQIFSDYQSEVD1YNRIRDEL* 



FIG. 43 
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MMFSGFNADY EASSSRCSSA SPAGDSLSYY HSPADSFSSM 

GSPVNAQDFC TDLAVSSANF IPTVTAISTS PDLQWLVQPA 

I.VSSVAPSQT RAPIIPFGVPA PSAGAYSRAG WKTMTGGRA 

♦LVS PREDICTED COILED-COIL* 
QSIGRRGKVE QLSPEEEEKR RIRRE *RNKMA AAK 

± 107x178x4* 

YALLMOTI5V 

YCRNRRREL ^ TDTLO AETD O LEDEKS A LOTEI ANLLKEKEKT ., V 
EFILAAHR* PACKIPDDL GFPEEMSVAS LDLTGGLPEV 
ATPESEEAFF LPLLNDPEPK PSVEPVKSIS SMELKTEPFD 
DFLFPASSRP SGSETARSVP DMDLSGSFYA LPLLNDPEPK 
I'SVEPVKSIS SMELKTEPFD DFLFPASSRP SGSETARSVP 
DMDLSGSFYA GSSSNEPSSD SLSSPTLLAL 

FIG. 44 



WO 01/51673 



PCT/USOO/35727 



SGWESYYKTEGDEEAEEEQEENLEASGDYKYSGRDSLIFLVDASKA 

MFESQSEDELTPFDMSIQCIQSVYISKUSSDRDLLAVVFYGTEKDKNS 

VNFKNIYVLQELDNPGAKRIl.ELDQFKGQQGQKRFQDMMGHGSDY 

SLSHVLWVCANLFSDVQFKMSHKRIMLFTNEDNPHGNDSAKASRAR 

TKAGDLRDTGIFLDLMMLKKI'GGFDISLFYRDIISIAI-DED 

± 107x178x4* 
VAEEMOTI5T 

*LVS PREDICTED COILED-COIL* 

VLRVM *FEE ±SSKLEDLLRKVRAKETRKRALSRLKLKLNKDIV* 1SV 



G1YNLVQKALV KPPPIK.LYRETN* EP V KTKTRTFNTSTGGLLLPS DTK R 

SQIYGSRQIILEKEETEELKRFDDPGLMLMGFKPLVLLKKHHLRPSLFVYPE 
ESLV1GS STLFSALLIKCLEKEVAALCRYTPRRNIPPYFVALVPQEEELDDQK 
IQVTPPGFQLVFLPFADDKRKMPFTEKIMATPEQVGKMKAIVEKLRFTYRS 
DSFKNPVLQQllFRNLEALALDLME 



♦PI2LZIPC* 

4-PEQAVDLTLPKVEAMNKRL* GSLVDEFKELVYPPDYNPEGKVTKR 

KIIDNEGSGSKRPKVEYSEEELKTHISKGTLGKFTVPMLKEACRAYGLKSG 

LKKQELLEALTKHFQD 



FIG. 45 
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GGOALSPQI ISAVTQGS11KNKEGMDAKS 



± 107x178x4 * 
VALLMOTI5V 

y^ LTAWSRTLVTFKDVFVDFTREEWKLLDT * AQQIVYRNV 

MLENYKNLVSLGYQLT* KPDVILRLEK.GEEPWLVEREIHQETHPD 

SETAFEIKSSVSSRSIFKDKQSCDIKMEGMARNDLWYLSLEEVWKCR 

DQLDKYQENPERHLRIIQLIHTGEKPYECKECGKSFSRSSHLIGHQKT 

MTGEEPYECKECGKSFSWFSHLVTHQRTHTGDKLYTCNQCGKSFVII 

SSRLIRHQRTHTGHKPYECPECGKSFRQSTHLiLHQRTHVRVRPYECN 

ECGKSYSQRSHLVVHHRIHTGLKPFECKDCGKCFSRSSHLYSHQRTH 

TGEKPYECFIDCGKSFSOSSALFVHQRfHTGEKPYECCQCGKAFIRKN 

DLIKI IQRU I VGAI-I YKCNQCGIIFSQNS 



*P23LZIPC* 

♦PFFVHQIAHTGEQFLTCGNQCGTALVNTSNLIGQTNHI* RENAY 



FIG. 46 
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